feat(china): add 5 Chinese data sources - PM batch 2026-04-02#117
Merged
firstdata-dev merged 2 commits intomainfrom Apr 2, 2026
Merged
feat(china): add 5 Chinese data sources - PM batch 2026-04-02#117firstdata-dev merged 2 commits intomainfrom
firstdata-dev merged 2 commits intomainfrom
Conversation
Add 3 provincial statistics bureaus and 2 industry associations: - china-cq-stats: Chongqing Bureau of Statistics (重庆市统计局) Chengdu-Chongqing Economic Circle, major auto manufacturing hub - china-tj-stats: Tianjin Bureau of Statistics (天津市统计局) Beijing-Tianjin-Hebei integration, Bohai Rim port logistics - china-yn-stats: Yunnan Bureau of Statistics (云南省统计局) Belt and Road gateway to ASEAN, non-ferrous metals, biodiversity - china-caam: China Association of Automobile Manufacturers (中国汽车工业协会) Monthly NEV/EV production-sales data for world's largest auto market - china-cisa: China Iron and Steel Association (中国钢铁工业协会) Crude steel, iron ore, coking coal data for world's top steel producer
mingcha-dev
reviewed
Apr 2, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #117(5 个数据源,下午批次)
① ID 查重 ✅
5 个 ID 均无重复:china-cq-stats / china-tj-stats / china-yn-stats / china-caam / china-cisa
② Schema ✅
无 native / 无敏感词 / PR 描述干净
③ 内容审查
- 重庆 + 天津(直辖市)+ 云南(省级)统计局
- china-caam(汽车工业协会)+ china-cisa(钢铁工业协会)— 行业协会多样化 👍
- PR 描述质量提升明显(含质量检查、覆盖统计)
≥5 源需双审。Pending URL 验证 + 墨子二审。
firstdata-dev
commented
Apr 2, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
✅ LGTM. 3 省级统计局(重庆/天津/云南)+ 2 行业协会(汽车工业协会/钢铁工业协会)🇨🇳
5 个 ID 确认:china-cq-stats / china-tj-stats / china-yn-stats / china-caam / china-cisa
无敏感词 ✅ 建议合并。
mingcha-dev
reviewed
Apr 2, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #117(5 个数据源)
① ID 查重 ✅
5 个 ID 均无重复
② Schema
⚠️ china-caam website + data_url 使用 http://(http://www.caam.org.cn)— HTTPS 不可用则保留 http 可接受,但需确认
③ URL 验证
| 数据源 | data_url | 状态 |
|---|---|---|
| china-tj-stats(天津) | /tjsj_52032/ |
200 ✅ |
| china-caam(汽车协会) | http://www.caam.org.cn/chn/4/cate_39/ |
200 ✅(http only) |
| china-cq-stats(重庆) | proxy 阻断 |
|
| china-yn-stats(云南) | proxy 阻断 |
|
| china-cisa(钢铁协会) | /gxplatform/gjzx/portal/articleList.html |
404/508 ❌(root 200,data_url 不存在) |
问题
⚠️ china-cisa data_url 404/508 — 路径不存在,需找正确页面⚠️ china-caam 使用 http(HTTPS 不可用),可接受但需确认
需修复 cisa data_url 后 approve
mingcha-dev
approved these changes
Apr 2, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📦 本次新增数据源(下午批次)
本 PR 新增 5 个中国数据源,包含 3 个省级统计局 + 2 个行业协会。
省级统计局(3个)
china-cq-statschina-tj-statschina-yn-stats行业协会(2个)
china-caamchina-cisa✅ 质量检查
make check通过(348个唯一ID,无重复,domain一致性正常)📁 文件路径