VelvetFlow: An engineering pipeline for robust multi-density clustering

Document Type : Full Length Article

Authors

Department of Computer Science, University of Tarbiat Modares, Tehran, I. R. Iran

Abstract

Problem. Real-world datasets seldom respect a single density scale: tight blobs, elongated ribbons, and isolated points often coexist. Classical algorithms such as DBSCAN or \textit{k}-means require domain-specific parameter tuning and provide only ad-hoc support for anomaly detection.
Solution. We introduce VelvetFlow, an engineering pipeline that turns a set of well-understood building blocks into a cohesive, end-to-end workflow for multi-density clustering \emph{and} principled outlier detection. The pipeline is composed of three reusable stages:
(i) \emph{Contextual-density splitting} assigns every point to a high- or low-density partition using a single neighbourhood size $k$.
(ii) \emph{Density-aware clustering} applies a Jaccard-guided \textit{FusedNeighbor}+DBSCAN routine to the sparse partition and HDBSCAN to the dense partition-without introducing new hyper-parameters.
(iii) \emph{Scaled-MST verification} re-examines the complete $k$-NN graph, flags weakly connected components, and validates them with a $k$-NN gate; this step recovers small remote clusters while filtering genuine anomalies.

Graphical Abstract

VelvetFlow: An engineering pipeline for robust multi-density clustering

Keywords

Main Subjects


Volume 10, Issue 4
December 2025
Pages 333-358
  • Receive Date: 10 May 2025
  • Revise Date: 06 July 2025
  • Accept Date: 09 August 2025
  • First Publish Date: 28 November 2025
  • Publish Date: 01 December 2025