Thank you for your interest in contributing to xgen-doc2chunk! This document provides guidelines and instructions for contributing.
- Clone the repository:
git clone https://github.com/master0419/doc2chunk.git
cd xgen_doc2chunk- Create a virtual environment and install dependencies:
# Using uv (recommended)
uv venv
source .venv/bin/activate # On Windows: .venv\Scripts\activate
uv pip install -e ".[dev]"
# Or using pip
python -m venv .venv
source .venv/bin/activate # On Windows: .venv\Scripts\activate
pip install -e ".[dev]"- Run tests:
python test_all_handlers.py- Follow PEP 8 guidelines
- Use type hints where appropriate
- Add docstrings for public functions and classes
- Keep functions focused and modular
- Add tests for new features
- Ensure all tests pass before submitting a PR
- Test with multiple document formats when applicable
- Fork the repository
- Create a new branch for your feature (
git checkout -b feature/amazing-feature) - Make your changes
- Run tests to ensure everything works
- Commit your changes (
git commit -m 'Add amazing feature') - Push to your branch (
git push origin feature/amazing-feature) - Open a Pull Request
When reporting issues, please include:
- Python version
- Operating system
- Document format being processed
- Minimal code to reproduce the issue
- Error messages and stack traces
We welcome feature requests! Please:
- Check if the feature already exists or is planned
- Provide a clear description of the feature
- Explain the use case and benefits
By contributing, you agree that your contributions will be licensed under the Apache License 2.0.