Add UTF-8 encoding support for query data and update version to 2.1.0 by semantiDan · Pull Request #9 · WPSemantix/timbr_python_http

semantiDan · 2025-12-10T10:12:12Z

Add UTF-8 Encoding to Query Data

Summary

Fixed encoding issue when sending SQL queries with unicode characters to the Timbr API. Queries containing non-ASCII characters (e.g., Chinese, Arabic, emoji, accented letters) were not properly encoded, causing server-side errors.

Changes Made

1. Core Fix (pytimbr_api/timbr_http_connector.py)

Modified run_query() function to explicitly encode string queries as UTF-8 bytes
Added type checking to handle both string and pre-encoded byte inputs
Changed: data = query → data = query.encode('utf-8') if isinstance(query, str) else query

2. Test Coverage (test/test_encoding.py) - New file

Added 4 comprehensive test cases to verify encoding functionality:
- test_run_query_with_unicode_characters - Tests Chinese, French accents, and emoji
- test_run_query_with_already_encoded_bytes - Ensures backward compatibility with byte inputs
- test_run_query_with_special_sql_characters - Tests SQL special characters and copyright symbol
- test_run_query_with_multilingual_text - Tests Latin, Cyrillic, Arabic, and Japanese characters

3. Version Bump (pyproject.toml)

Updated version from 2.0.0 to 2.1.0 (minor version bump for new functionality)

4. Dependencies (requirements.txt)

Updated file encoding to UTF-8

Impact

Fixes issues with international characters and special symbols in SQL queries
Maintains backward compatibility with existing code
All tests pass successfully

Copilot

Pull request overview

This PR adds UTF-8 encoding support for SQL queries containing unicode characters (e.g., Chinese, Arabic, emoji, accented letters) to prevent server-side errors. The fix ensures queries are properly encoded before being sent to the Timbr API.

Key Changes:

Modified run_query() to explicitly encode string queries as UTF-8 bytes
Added comprehensive test coverage for unicode character handling
Bumped version from 2.0.0 to 2.1.0

Reviewed changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated 4 comments.

File	Description
pytimbr_api/timbr_http_connector.py	Added UTF-8 encoding logic with type checking for backward compatibility
test/test_encoding.py	New test file with 4 test cases covering unicode, multilingual text, and special characters
pyproject.toml	Version bump to 2.1.0 reflecting the new encoding functionality

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…yproject.toml

… and update user naming convention in tests

…n statements

… character preservation

Add UTF-8 encoding support for query data and update version to 2.1.0

a7f9179

semantiDan self-assigned this Dec 10, 2025

semantiDan requested a review from Copilot December 10, 2025 10:12

Copilot AI reviewed Dec 10, 2025

View reviewed changes

Comment thread pytimbr_api/timbr_http_connector.py

Comment thread test/test_encoding.py

Comment thread test/test_encoding.py

Comment thread test/test_encoding.py

timbr_admin added 4 commits December 10, 2025 12:50

Update Python version requirements to 3.10 in workflow, README, and p…

18525ae

…yproject.toml

Refactor user creation statements to include password in SQL commands…

087fbb9

… and update user naming convention in tests

Add TIMBR_USER_PASSWORD to test configuration and update user creatio…

ee6d93d

…n statements

Enhance UTF-8 encoding support in run_query and add tests for unicode…

04eecb3

… character preservation

semantiDan merged commit 77e0b32 into main Dec 11, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add UTF-8 encoding support for query data and update version to 2.1.0#9

Add UTF-8 encoding support for query data and update version to 2.1.0#9
semantiDan merged 5 commits into
mainfrom
feature/add-utf8-encoding-to-query-data

semantiDan commented Dec 10, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

semantiDan commented Dec 10, 2025

Add UTF-8 Encoding to Query Data

Summary

Changes Made

Impact

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants