CoTBasedAbuseDetection

A Chain-of-Thought (CoT) based abuse detection system that leverages reasoning capabilities to identify and classify abusive content with explainable decision-making.

Overview

This project implements a sophisticated abuse detection system that uses Chain-of-Thought prompting to provide transparent, step-by-step reasoning for content moderation decisions. The system can detect various forms of abuse including harassment, hate speech, cyberbullying, and toxic behavior across different platforms.

Features

Chain-of-Thought Reasoning: Step-by-step reasoning process for transparent decision-making
Multi-type Abuse Detection: Supports detection of harassment, hate speech, cyberbullying, and toxicity
Explainable AI: Provides clear explanations for each moderation decision
Configurable Thresholds: Adjustable sensitivity levels for different use cases
Batch Processing: Efficient processing of large content datasets
Real-time Detection: API endpoints for live content moderation
Performance Metrics: Comprehensive evaluation and monitoring tools

Project Structure

CoTBasedAbuseDetection/
├── src/
│   ├── models/          # Core detection models
│   ├── cot/            # Chain-of-Thought implementation
│   ├── data/           # Data processing utilities
│   ├── utils/          # Helper functions
│   └── evaluation/     # Evaluation and metrics
├── notebooks/          # Jupyter notebooks for experimentation
├── tests/             # Unit and integration tests
├── data/              # Dataset storage
│   ├── raw/           # Raw datasets
│   ├── processed/     # Processed datasets
│   └── examples/      # Example data for testing
├── results/           # Model outputs and results
├── configs/           # Configuration files
└── scripts/           # Utility scripts

Quick Start

Install Dependencies
```
pip install -r requirements.txt
```

Run Basic Detection

python scripts/detect_abuse.py --text "Your text here"

Start Interactive Demo
```
python scripts/demo.py
```

Chain-of-Thought Process

The system follows a structured reasoning process:

Content Analysis: Initial examination of text features
Context Understanding: Interpretation of implicit meanings
Pattern Recognition: Identification of abusive patterns
Severity Assessment: Evaluation of harm potential
Final Classification: Decision with confidence score
Explanation Generation: Clear reasoning for the decision

Use Cases

Social Media Platforms: Automated content moderation
Online Communities: Forum and comment moderation
Educational Platforms: Safe learning environment maintenance
Customer Support: Abuse detection in communications
Research: Analysis of online abuse patterns

Getting Started

See the notebooks/ directory for detailed examples and tutorials.

Contributing

Contributions are welcome! Please read our contributing guidelines and submit pull requests for any improvements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
data		data
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
deepseek_quick_eval_results.json		deepseek_quick_eval_results.json
deployment_info.json		deployment_info.json
enhanced_cot_deployment_info.json		enhanced_cot_deployment_info.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoTBasedAbuseDetection

Overview

Features

Project Structure

Quick Start

Chain-of-Thought Process

Use Cases

Getting Started

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CoTBasedAbuseDetection

Overview

Features

Project Structure

Quick Start

Chain-of-Thought Process

Use Cases

Getting Started

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages