Skip to content

ruchi09/Big-data

Repository files navigation

Big-data

This repository is a collection of codes implementing Big data concepts and algorithms, many of them being direct implementation of some well known research papers. This reposirory includes following:

  1. Classification based on Associations
  2. A-Close
  3. Improved Apriori implementation using hashing
  4. Improved Apriori implementation using partition based approach
  5. Improved Apriori implementation using transaction reduction
  6. CHARM
  7. Dynamic Itemset Counting
  8. Equivalence Class LAttice Traversal
  9. MAximal Frequent Itemset Analysis
  10. Pincer Search
  11. Pyspark programs: A collection of basic programs written with pyspark

The generate_itemsets.py can be used to generate many custom datasets for the given codes.

About

A collection of codes implementing Big data concepts and algirithms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages