Efficient Constrained K-center Clustering with Background Knowledge

Guo, Longkun, Jia, Chaoqi, Liao, Kewen, Lu, Zhigang, and Xue, Minhui (2024) Efficient Constrained K-center Clustering with Background Knowledge. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (38) pp. 20709-20717. From: AAAI 2024: 38th AAAI Conference on Artificial Intelligence, 20-27 February 2024, Vancouver, Canada.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1609/aaai.v38i18.3005...


Abstract

Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted k-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including k-center are inherently NP-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained k-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.

Item ID: 87297
Item Type: Conference Item (Research - E1)
ISSN: 2374-3468
Copyright Information: © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
Date Deposited: 12 Nov 2025 02:24
FoR Codes: 46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460299 Artificial intelligence not elsewhere classified @ 100%
SEO Codes: 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220403 Artificial intelligence @ 100%
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page