A fault-tolerant microservices pipeline designed to process system logs and detect anomalies in real-time without data loss during traffic spikes.
This system decouples log generation from processing using a message queue, ensuring high availability and backpressure management. It consists of three core microservices containerized with Docker:
- Producer: Generates simulated system logs at varying volumes to mimic real-world traffic.
- Ingestor: Acts as the API gateway, receiving logs from the producer and pushing them securely into an AWS SQS queue.
- Analyzer: A background worker that continuously polls AWS SQS for new logs. It runs the data through an Isolation Forest (Machine Learning) model to autonomously detect system errors and anomalies, bypassing the need for manual, static monitoring rules.
- Languages & Frameworks: Python
- Infrastructure: Docker, Docker Compose
- Cloud (AWS): SQS (Message Queuing), DynamoDB (Storage)
- Machine Learning: Scikit-Learn (Isolation Forest)
- Clone the repository:
git clone [https://github.com/SriHarsha-23/distributed-log-analyzer.git](https://github.com/SriHarsha-23/distributed-log-analyzer.git)