First, this is amazing! It opens my eyes on the unknown clusters of the github repo. But i wonder could you shed some light (or code) on the implementation? Thanks in advance.