#119 Add synthetic Data generator for request_comments and volunteer_rating#133
Merged
Nishu2000-hub merged 2 commits intomainfrom Apr 17, 2026
Merged
Conversation
Contributor
Author
|
After submission, discovered the official help_categories table schema
|
- Added 3-tier text generation: compositional grammar, stochastic perturbation, and diversity validator for ML-training-grade diversity - Added standalone usage mode (--rows flag) - Regenerated request_comments.csv and volunteer_rating.csv with higher-quality diverse data Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Nishu2000-hub
added a commit
that referenced
this pull request
Apr 17, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Synthetic data generator for two tables: request_comments and volunteer_rating.
What's included
database/mock-data-generation/generate_119.py— the generator scriptdatabase/mock-data-generation/README.md— documentationdatabase/mock_db/request_comments.csv— 100 rowsdatabase/mock_db/volunteer_rating.csv— 100 rowsHow to run