| apriori | Linux executable | (99 kb) | ||
| apriori.exe | Windows console executable | (132 kb) | ||
| apriori.zip | C sources, version 5.19, 2010.07.22 | (203 kb) | ||
| apriori.tar.gz | (178 kb) | |||
| census.zip | census data set (from the UCI ML repository) | (390 kb) | ||
| census | shell script used for the conversion | (1 kb) |
A program to find association rules and frequent item sets (also closed and maximal) with the apriori algorithm (Agrawal et al. 1993), which carries out a breadth first search on the subset lattice and determines the support of item sets by subset tests. This is a pretty fast implementation that uses a prefix tree to organize the counters for the item sets.
Full description of this program (included in the source package).
Attention: Please note that versions 3.x and 4.x differ from earlier versions in how they interpret the options -l, and -y. This is especially important for the option -l, which now means "do not load transactions into main memory", because using this option considerably reduces performance. Note also that since version 5.0 the default target type is sets rather than rules (in order to have a uniform default for all frequent item set mining programs).
Earlier versions of this program are incorporated in the data mining tool Clementine, available from SPSS (apriori version 1.8 in Clementine version 5.0, apriori version 2.7 in Clementine version 7.0, last version shipped to SPSS is 4.30).
A graphical user interface for this program (ARView), written in Java, is available here.
Another graphical user interface for this program, which is based on Gnome 2 and was developed by togaware, can be found here. Another graphical user interface, which is based on the wxWidgets and was developed by the STK++ team can be found here. (However, I cannot guarantee that these GUIs works with the latest version of the command line program made available here.)
Papers that describe some implementation aspects of this program:
Some other references:
More information about frequent item set mining, implementations of other algorithms as well as test data sets can be found at the Frequent Itemset Mining Implementations Repository.