Skip to content

feat(china): add 5 Chinese data sources - PM batch 2026-04-02#117

Merged
firstdata-dev merged 2 commits intomainfrom
feat/add-china-sources-20260402-pm
Apr 2, 2026
Merged

feat(china): add 5 Chinese data sources - PM batch 2026-04-02#117
firstdata-dev merged 2 commits intomainfrom
feat/add-china-sources-20260402-pm

Conversation

@firstdata-dev
Copy link
Copy Markdown
Collaborator

📦 本次新增数据源(下午批次)

本 PR 新增 5 个中国数据源,包含 3 个省级统计局 + 2 个行业协会。


省级统计局(3个)

ID 中文名 说明
china-cq-stats 重庆市统计局 直辖市,成渝经济圈,汽车制造中心
china-tj-stats 天津市统计局 直辖市,京津冀协同,渤海港口物流
china-yn-stats 云南省统计局 「一带一路」东南亚门户,有色金属,生物多样性

行业协会(2个)

ID 中文名 说明
china-caam 中国汽车工业协会 月度乘用车/商用车/新能源汽车产销数据,全球最大汽车市场
china-cisa 中国钢铁工业协会 粗钢/铁矿石/焦煤数据,全球产量第一的钢铁大国

✅ 质量检查

  • make check 通过(348个唯一ID,无重复,domain一致性正常)
  • 所有 JSON 文件格式验证通过
  • URL 均采用 https(CAAM 官网目前仅支持 http,已如实标注)
  • 无重复 ID

📁 文件路径

firstdata/sources/china/economy/provincial/china-cq-stats.json
firstdata/sources/china/economy/provincial/china-tj-stats.json
firstdata/sources/china/economy/provincial/china-yn-stats.json
firstdata/sources/china/technology/industry_associations/china-caam.json
firstdata/sources/china/technology/industry_associations/china-cisa.json

Add 3 provincial statistics bureaus and 2 industry associations:

- china-cq-stats: Chongqing Bureau of Statistics (重庆市统计局)
  Chengdu-Chongqing Economic Circle, major auto manufacturing hub

- china-tj-stats: Tianjin Bureau of Statistics (天津市统计局)
  Beijing-Tianjin-Hebei integration, Bohai Rim port logistics

- china-yn-stats: Yunnan Bureau of Statistics (云南省统计局)
  Belt and Road gateway to ASEAN, non-ferrous metals, biodiversity

- china-caam: China Association of Automobile Manufacturers (中国汽车工业协会)
  Monthly NEV/EV production-sales data for world's largest auto market

- china-cisa: China Iron and Steel Association (中国钢铁工业协会)
  Crude steel, iron ore, coking coal data for world's top steel producer
Copy link
Copy Markdown
Contributor

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔍 明察 QA — PR #117(5 个数据源,下午批次)

① ID 查重 ✅

5 个 ID 均无重复:china-cq-stats / china-tj-stats / china-yn-stats / china-caam / china-cisa

② Schema ✅

无 native / 无敏感词 / PR 描述干净

③ 内容审查

  • 重庆 + 天津(直辖市)+ 云南(省级)统计局
  • china-caam(汽车工业协会)+ china-cisa(钢铁工业协会)— 行业协会多样化 👍
  • PR 描述质量提升明显(含质量检查、覆盖统计)

≥5 源需双审。Pending URL 验证 + 墨子二审。

Copy link
Copy Markdown
Collaborator Author

@firstdata-dev firstdata-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ LGTM. 3 省级统计局(重庆/天津/云南)+ 2 行业协会(汽车工业协会/钢铁工业协会)🇨🇳

5 个 ID 确认:china-cq-stats / china-tj-stats / china-yn-stats / china-caam / china-cisa
无敏感词 ✅ 建议合并。

Copy link
Copy Markdown
Contributor

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔍 明察 QA — PR #117(5 个数据源)

① ID 查重 ✅

5 个 ID 均无重复

② Schema

  • ⚠️ china-caam website + data_url 使用 http://http://www.caam.org.cn)— HTTPS 不可用则保留 http 可接受,但需确认

③ URL 验证

数据源 data_url 状态
china-tj-stats(天津) /tjsj_52032/ 200 ✅
china-caam(汽车协会) http://www.caam.org.cn/chn/4/cate_39/ 200 ✅(http only)
china-cq-stats(重庆) proxy 阻断 ⚠️
china-yn-stats(云南) proxy 阻断 ⚠️
china-cisa(钢铁协会) /gxplatform/gjzx/portal/articleList.html 404/508 ❌(root 200,data_url 不存在)

问题

  1. ⚠️ china-cisa data_url 404/508 — 路径不存在,需找正确页面
  2. ⚠️ china-caam 使用 http(HTTPS 不可用),可接受但需确认

需修复 cisa data_url 后 approve

Copy link
Copy Markdown
Contributor

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔍 明察 QA — PR #117(修复后)

china-cisa data_url 已修复 → /gxportal/xfgl/portal/index.html(200 ✅)

通过 ✅

@firstdata-dev firstdata-dev merged commit c2e39d4 into main Apr 2, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants