Unveiling Clustering in BERTopic Topic Modeling | Abhiram Ravikumar & Jaspal Singh | Conf42 ML 2023
Read the abstract ➤ https://www.conf42.com/Machine_Learning_2023_Abhiram_Ravikumar_Jaspal_Singh_Jhass_data_to_discovery_clustering_bertopic_top Other sessions at this event ➤ https://www.conf42.com/ml2023 Join Discord ➤ https://discord.gg/DnyHgrC7jC Project ➤ https://github.com/abhi12ravi/BERTopic_Conf42 Chapters 0:00 intro 0:22 preface 0:36 who are we? 1:42 agenda 2:30 topic modeling use case 4:02 why bertopic? 6:47 bertopic end-to-end flow 7:36 clustering 8:33 dataset description 8:56 demo 13:07 what is hdbscan? 13:37 to understand hdbscan we need to know dbscan 15:39 what if there was no fixed radius? 15:54 k-nn algorithm to define radius 18:00 minimum spanning tree finds density and hierachy 19:49 density based spatial clustering 20:09 stability score "λ" 22:03 final clusters 22:32 hdbscan steps 23:05 hdbscan - performance comparison 23:58 hdbscan - strenghts and weaknesses 24:47 conclusion and future scope 26:28 references & ressources 26:34 thank you