An automated safety filter testing system for defensive security research and AI safety evaluation.
This tool is designed for defensive security research to test AI safety filters and improve content moderation systems. It helps identify potential vulnerabilities in AI systems to strengthen their defenses. kimi helps to create Prompts which later is given to ChatGPT models for Image Gen. The prompts are saved in logs with the output image.
- Python 3.8+
- API keys for OpenAI and KIMI( You can use any other model by tweaking the code)
# Clone the repository
git clone https://github.com/yourusername/automator-redteam.git
cd automator-redteam
# Install dependencies
pip install -r requirements.txt
# Set up environment variables
cp .env.example .env
# Edit .env with your API keys# Interactive mode (beautiful menu)
python3 arth.py
# Automated testing
python3 arth.py --mode auto --iterations 20 --theme "safety testing"
# Generate reports
python3 arth.py --mode reportautomator-redteam/
├── arth.py # Main red team terminal
├── requirements.txt # Python dependencies
├── .gitignore # Git ignore rules
├── README.md # This file
├── images/ # Generated test images
├── logs/ # JSON test logs
└── reports/ # Analytics reports
Create a .env file with your API keys:
OPENAI_API_KEY=your_openai_key_here
kimi_API_KEY=your_kimi_key_here
- Multi-API Integration: kimi API and OpenAI GPT Image 1
- Safety Testing: Automated content moderation testing
- Comprehensive Logging: JSON format for security analysis
- Visual Reports: Generated charts and analytics
- Interactive Interface: Terminal-based menu system
All test results are saved in:
logs/- JSON logs with test detailsimages/- Generated test imagesreports/- Analytics and summaries
This tool is intended for:
- Security researchers
- AI safety teams
- Defensive testing purposes
- Academic research
Important: Always follow responsible disclosure practices when identifying vulnerabilities.
- Fork the repository
- Create a feature branch
- Submit a pull request with clear description
- Ensure all tests pass
This project is released for educational and defensive security research purposes.