k-mean clustering and its real usecase in the security domain

Pratikkorgaonkar
2 min readAug 12, 2021

--

In this article you are going to known about what is k-mean clustering , where you can use it and it’s industry usecase in security domain.

When trying to analyze data, one approach might be to look for meaningful groups or clusters. Clustering is dividing data into groups based on similarity. And K-means is one of the most commonly used methods in clustering. It is because it’s simplicity.

How does k-mean clustering works?

Let’s say we’d like to divide the following points into clusters.

step1:choose k value for ex: k=2

step2:initialize centroids randomly

step3:calculate Euclidean distance from centroids to each data point and form clusters that are close to centroids

step4: find the centroid of each cluster and update centroids

step:5 repeat step3

Each time clusters are made centroids are updated, the updated centroid is the center of all points which fall in the cluster. This process continues till the centroid no longer changes i.e solution converges.

How To Choose K Value In K-Means:

1.Elbow method

steps:

step1: compute clustering algorithm for different values of k.

for example k=[1,2,3,4,5,6,7,8,9,10]

step2: for each k calculate the within-cluster sum of squares(WCSS).

step3: plot curve of WCSS according to the number of clusters.

step4: The location of bend in the plot is generally considered an indicator of the approximate number of clusters.

We can also use this concepts of kmeans clustering in cyber security .for example

  • Customer Profiling
  • Market segmentation
  • Computer vision
  • Geo-statistics
  • Astronomy
  • Document clustering
  • Identifying crime-prone areas
  • Cluster analysis
  • Feature learning or dictionary learning
  • Identifying crime-prone areas

Thank you for reading this article!!!!!!!

--

--