Skip to content

agarwalvishal/markdownify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Steps to run:

  1. Install packages
pip install -r requirements.txt
  1. Set values in .env file
FIRECRAWL_API_KEY = ""
URL = "" #URL to crawl
SOURCE_LIBRARY = "" #Name of the library being crawled (optional)
  1. Crawl and save the data
python crawl_and_save.py
  1. Process the saved data to markdown
python process.py
  1. The output is available inside markdown_docs folder.

Note

There are two scripts namely crawl_and_save.py and process.py to first crawl and save raw data to avoid having to crawl again and spend unnecessary credits in case of processing failures.

About

Convert websites to LLM ready markdown format

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages