Abstract
Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In practice, many constraints require threshold values whose choice is often arbitrary. This difficulty is even harder when several thresholds are required and have to be combined. Moreover, patterns barely missing a threshold will not be extracted even if they may be relevant. The paper advocates the introduction of softness into the pattern discovery process. By using Constraint Programming, we propose efficient methods to relax threshold constraints as well as constraints involved in patterns such as the top-k patterns and the skypatterns. We show the relevance and the efficiency of our approach through a case study in chemoinformatics for discovering toxicophores.
| Original language | English |
|---|---|
| Pages (from-to) | 193-221 |
| Number of pages | 29 |
| Journal | Journal of Intelligent Information Systems |
| Volume | 44 |
| Issue number | 2 |
| DOIs | |
| State | Published - Apr 2015 |
| Externally published | Yes |
Keywords
- Chemoinformatics
- Constraint Programming
- Constraint-based pattern mining
- Disjonctive relaxation
- Soft constraints
- Soft skypatterns
Fingerprint
Dive into the research topics of 'Soft constraints for pattern mining'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver