markdownify/README.md at main · agarwalvishal/markdownify

Steps to run:

Install packages

pip install -r requirements.txt

Set values in .env file

FIRECRAWL_API_KEY = "" #your_api_key_here
URL = "" #URL to crawl
LIMIT = 175 #Number of pages to crawl
SOURCE_LIBRARY = "" #Name of the library being crawled (optional)

Crawl and save the data

python crawl_and_save.py

Process the saved data to markdown

python process.py

The output is available inside markdown_docs folder.

Note

There are two scripts namely crawl_and_save.py and process.py to first crawl and save raw data to avoid having to crawl again and spend unnecessary credits in case of processing failures.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Steps to run:

Note

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Steps to run:

Note