Hierarchichal clustering

Imagine sorting a basket of fruits. Clustering groups similar fruits based on features like color, size, and shape.

[{"selector":"#anim-d8b9ad52-2912-430c-a46f-4daf106d692a","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b187c53f-f085-4f47-97a8-05c81e4fad02","keyframes":{"transform":["translate3d(0px, 170.33976%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-cacef6c7-c8c7-4b9e-a9b8-7dc329b92ae6","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-0038781a-5f45-44ca-a2fb-4b86efab245c","keyframes":{"transform":["translate3d(0px, -240.27785%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

Hierarchical clustering starts with individual data points and progressively merges them into clusters based on similarity.

[{"selector":"#anim-5631380c-e93b-4b1e-952e-2d7815a35442","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-11f5aa5b-9dbb-4f17-be14-3bf318553202","keyframes":{"transform":["translate3d(0px, 156.60227%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-eed7b35c-3921-4486-a9a8-26a6500fc623","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-4860958c-97c5-484a-a689-7cbf84a895fc","keyframes":{"transform":["translate3d(0px, 227.94594%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-22155bfa-651e-46c9-993b-5bd9e7b97041","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-194781f8-49e5-462f-ac8a-2dad8ccb3c3e","keyframes":{"transform":["translate3d(0px, -367.19569%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Read More

– Agglomerative: Starts with single points. – Divisive: Starts with all data in one cluster.

[{"selector":"#anim-6faac8f0-9aa0-40b0-95dc-ab1e837b42d5","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-90812b64-6e8c-48f6-81f9-dab272b1c713","keyframes":{"transform":["translate3d(0px, 206.80489%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9beda1bd-4e00-452e-822a-8c5ddfd2a518","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-ff489e2e-6c36-4498-a631-e1d53f024e82","keyframes":{"transform":["translate3d(0px, 227.94594%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a458a574-5505-4849-a34a-00b7efe70546","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-4b14d21d-145e-403b-aa5c-6cce79e118fc","keyframes":{"transform":["translate3d(0px, -240.27785%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Read More

To measure similarity, we use distance metrics like Euclidean distance (straight line) or Manhattan distance (taxicab geometry).

[{"selector":"#anim-22997d91-bb22-4fd4-8a58-1e8c099318cb","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-f697a91d-0e0d-45fb-8e96-d9c0fa92fe41","keyframes":{"transform":["translate3d(0px, 162.03706%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b424c9e8-5891-43c4-bf64-4ea5f29ee54c","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-f61966c7-fc33-43d2-b259-ea33b0294def","keyframes":{"transform":["translate3d(0px, -240.27785%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-265f866e-6d88-4ef2-876c-d6bdc6f4f94b","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-00b787f3-a24d-4781-986b-74ccab439ee9","keyframes":{"transform":["translate3d(0px, 227.94594%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Read More

A dendrogram, a tree-like structure, shows the merging process and the relationships between clusters at different levels.

[{"selector":"#anim-1310c5d4-02f6-4e52-960f-ce251631a5f4","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-eb7ef7e5-f780-46b1-b2f6-3ee8cb9ef4d1","keyframes":{"transform":["translate3d(0px, 184.80256%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7dfb7e8b-07cc-4c62-aa20-1821d7eed11e","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-0283ba15-07f7-4304-a041-4da595501270","keyframes":{"transform":["translate3d(0px, -240.27785%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

– Customer segmentation – Image analysis – Document clustering

[{"selector":"#anim-95d02222-3e61-49c1-9c7e-d14d706f90c0","keyframes":{"opacity":[0,1]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-df0301e6-526d-4074-8ece-c42271d2ed91","keyframes":{"transform":["translate3d(0px, 248.40357%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":900,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-6116977a-aed6-4759-87ba-f1cbf6510334","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b237ffdb-731e-4485-89c3-ec3d7515ae3e","keyframes":{"transform":["translate3d(0px, -259.02782%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

Hierarchichal clustering

Imagine sorting a basket of fruits. Clustering groups similar fruits based on features like color, size, and shape.

What is Clustering?

Hierarchical clustering starts with individual data points and progressively merges them into clusters based on similarity.

The Hierarchy

– Agglomerative: Starts with single points. – Divisive: Starts with all data in one cluster.

Two Main Approaches

To measure similarity, we use distance metrics like Euclidean distance (straight line) or Manhattan distance (taxicab geometry).

Choosing the Right Distance

A dendrogram, a tree-like structure, shows the merging process and the relationships between clusters at different levels.

Visualizing the Hierarchy

– Customer segmentation – Image analysis – Document clustering

Real-World Applications