Skip to content

C0oki3s/ScribdT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ScribdT - Focused on Extracting Data from Scribd

Overview

ScribdT is a powerful tool designed to search for documents or users on Scribd and extract specific data based on given queries. It provides functionalities to perform user brute force searches and retrieve user data from a SQLite database.

Table of Contents

  1. Installation
  2. Usage
  3. Subcommands
  4. Upgrading the Package

Installation

To install ScribdT, you need to clone the repository and install the package using setup.py.

git clone <repository-url>
cd ScribdT
pip install .
git clone <repository-url>
cd ScribdT
py -3.12 -m venv venv
pip install .
python -m spacy download en_core_web_lg

Usage

To use ScribdT, run the ScribdTool command with the appropriate subcommand and options. The general usage format is:

ScribdT [-h] {documents,users,r} ...

Subcommands

usage: app.py [-h] {documents,users,r} ...

Search Scribd for documents or users.
options:
  -h, --help           show this help message and exit

subcommands:
  {documents,users,r}
    documents          Search for documents
    users              Search for users
    r                  Retrieve data from SQLite database

Querying Documents

To search for documents on Scribd, use the documents subcommand. The general format is:

Example

ScribdT documents query="passport" --filters filters.json --cookies cookies.txt

This command searches Scribd for documents related to "passport" and uses the provided filters and cookies.

filters is a optional Tag as it uses to detect parameters for presidio-analyzer.

--filters filters.json:

  • Optional argument. Provides additional filtering rules via a JSON file.

  • These filters can refine the search using custom logic.

  • Often used to integrate with presidio-analyzer for detecting sensitive entities like PII (Personally Identifiable Information).

The filters.json file might contain parameters like:

{
"entities": ["PERSON", "PHONE_NUMBER", "EMAIL_ADDRESS"]
}

--cookies cookies.txt:

  • Supplies a cookies file used to authenticate with Scribd.

  • _scribd_session=yourCookies It can be anywhere as we have --cookies flag

User Bruteforce

To search for users on Scribd, use the users subcommand. The general format is:

Example

ScribdT users -ue=500 --db scribd_users.db

This command performs a brute force search for up to 500 users on Scribd and saves the results to scribd_users.db.

Retrieve Saved Data

To retrieve data from a SQLite database, use the r subcommand. ScribdT supports retrieving users and searching by username.

ScribdT r -u --db DATABASE_FILE

This command retrieves all users from the specified SQLite database file.

Retrieve by Username

ScribdT r --username=johndoe --db scribd_users.db

This command retrieves information about the user with the username "johndoe" from the specified SQLite database file.

Upgrading the Package

pip install --upgrade .

If you appreciate my efforts, I would greatly appreciate your support. Your encouragement keeps me motivated to explore more in cybersecurity and continue developing open-source tools. Thank you for your support!

Buy Me A Coffee

About

ScribdT - Focused on Extracting Data from Scribd

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors

Languages