This data was first 'compiled' for use in the following papers (please acknowledge at least one if you are using it in a publication):
- Maximum entropy models and subjective interestingness: an application to tiles in binary databases (Data Mining and Knowledge Discovery, 2011)
- A framework for mining interesting patterns (Useful Patterns workshop and SIGKDD Explorations, 2010)
- An information-theoretic approach to finding informative noisy tiles in binary databases (SDM proceedings, 2010)
- Finding interesting itemsets using a probabilistic model for binary databases (University of Bristol Technical Report, 2009)
- Explicit probabilistic models for databases and networks (University of Bristol Technical Report, 2009)
Another relevant paper is:
- An information theoretic framework for data mining (KDD proceedings, 2011)
| Attachment | Size |
|---|---|
| ICDM.zip | 561.48 KB |
| KDD.zip | 711.1 KB |
| PUBMED.zip | 1.74 MB |
