This is a web service to find top review books and their key words from The-Book website.
Epoch: an epoch is defined by an api call, every 4 seconds
N: is a counter on epochs, where N=1 is the epoch (currentTime, currentTime - 4secs), N = 10 is the epoch (currentTime - 36secs, currentTime - 40secs)
X: is number of top review books in that epoch of query
3 endpoints:
-
Top review: enter the epoch time from now N (1-50) and number of top books X (1-50), it will return top X books with their review scores
api: http://ec2-18-216-190-42.us-east-2.compute.amazonaws.com:5002/topReview/N/X
-
Top similarity: enter the epoch time from now N (1-50) and number of top books X (1-50), it will return top X books and the similarity score among other top review books
api: http://ec2-18-216-190-42.us-east-2.compute.amazonaws.com:5002/topSimilarity/N/X
-
Top key words: enter the epoch time from now N (1-50) and number of top books X (1-50), it will return top X books with their top 5 key words in the book summaries
api: http://ec2-18-216-190-42.us-east-2.compute.amazonaws.com:5002/topKeyWords/N/X
- Redis: As a data base to store books data, every data will expire after 200 seconds in the database
- Flask: Used for buliding REST API
- scikit-learn: Used Tf-idf package for computing similarity of book titles and key words extraction for book summaries
- AWS: Used for deploy the web service
All books data comes from: https://di37ol03g7.execute-api.us-west-2.amazonaws.com/dev/