Skip to content

Scrapy crawler to collect all Spanish words in the dictionary.

Notifications You must be signed in to change notification settings

Alex44lel/RAE-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

All Spanish Words - RAE SCRAPER

Badge Status GitHub last commit

Scrapy crawler to collect all Spanish words in the dictionary.

Run scrapy crawler

Install all requirements from requirements.txt

pip install requirements.txt

Go to scrapy/rae folder

cd scrapy/rae

Execute the crawler and save result to json

scrapy crawl raespiderwords  -o words.json
scrapy crawl raespiderdefinitions   -o words.json

Considerations

The scrappy crawler will take approximately 4 hours to get all words from RAE. There is no proxy middleware implemented on the crawler, but this has not posed any problems in previous use cases.

Aditional crawler

There is also a selenium bot that does the same as the scrapy crawler. To execute it go to selenium folder

cd selenium

Execute the crawler and save result to json

python scrapper.py

Considerations

Selenium crawler is just for demonstration purposes, do not try to obtain all words with it

By Alejandro

About

Scrapy crawler to collect all Spanish words in the dictionary.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages