| sam | Linux executable | (193 kb) |
| sam.exe | Windows console executable | (200 kb) |
| sam.zip | C sources, version 2.56, 2012.01.20 | (161 kb) |
| sam.tar.gz | (145 kb) | |
| census.zip | census data set (from the UCI ML repository) | (390 kb) |
| census | shell script used for the conversion | (1 kb) |
A program to find frequent item sets (also closed and maximal) with a split and merge algorithm (Split and Merge), which combines a depth-first traversal of the subset lattice with a horizontal transaction representation. The main strength of this algorithm is not its speed (although it is not slow, but even outperforms apriori and eclat on some data sets), but the simplicity of its structure. Basically all the work is done in one recursive function of about fairly few lines of code. In addition, it only uses a simple array as the only data structure.
Papers that describe this algorithm:
More information about frequent item set mining, implementations of other algorithms as well as test data sets can be found at the Frequent Itemset Mining Implementations Repository.