spark_ml

The purpose Of this project is to demonstrate the usage of spark mllib from training to predicting ecommercial product's category.

Training Stage:

Uses tfidf to vectorize text context
Uses Cliffisier Algorithms such as Naive Bayes, Ovr Logistic Regression, Random Forest to train the model

Predict Stage:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
spark_libs		spark_libs
.gitignore		.gitignore
README.md		README.md
predict_category.py		predict_category.py
train_category.py		train_category.py
train_w2v.py		train_w2v.py
train_w2v_category.py		train_w2v_category.py

Provide feedback