docs: add troubleshooting guide for common plugin issues#1260
docs: add troubleshooting guide for common plugin issues#1260S-anskriti wants to merge 2 commits intojenkinsci:mainfrom
Conversation
Add docs/troubleshooting.md covering: - OTLP endpoint connection failures and port configuration - Authentication errors (Bearer tokens, OTLP headers) - Known workaround for metrics stopping after config change (jenkinsci#424) - Missing ci.pipeline.* metrics and Collector configuration (jenkinsci#930) - How to enable FINE log level for debugging Add Troubleshooting section to README.md linking to the new guide.
|
Hi @shivaylamba and @krisstern, I'm a GSoC 2026 applicant interested in the "Use OpenTelemetry |
|
|
||
| **Symptom:** No traces or metrics appear in your backend. The Jenkins system log shows: | ||
| **Checklist:** | ||
|
|
There was a problem hiding this comment.
the first thing you have to do is check the logs, Jenkins, Collector, and backend
There was a problem hiding this comment.
Great point! I'll add log checking as the first step in the checklist
before the other items. Thanks for the review!
| **Symptom:** The Jenkins log shows `UNAUTHENTICATED` or `PERMISSION_DENIED`. | ||
|
|
||
| **Checklist:** | ||
|
|
There was a problem hiding this comment.
Understood, will add log checking as step 1 here as well.
docs/troubleshooting.md
Outdated
| ## Metrics stop appearing after a configuration change | ||
|
|
||
| **Symptom:** Traces and metrics were working but stopped after updating the | ||
| plugin configuration. | ||
|
|
||
| **Known workaround:** Restart the Jenkins controller after changing | ||
| OpenTelemetry settings. The exporter pipeline does not always reinitialise | ||
| without a restart. | ||
|
|
||
| Related: [Issue #424](https://github.com/jenkinsci/opentelemetry-plugin/issues/424) | ||
|
|
There was a problem hiding this comment.
This is resolved and the "issue" descrition is vage, I am not sure if it worth to mention it.
There was a problem hiding this comment.
Thanks for the clarification! I'll remove this section entirely.
Since the issue has been resolved. I didn't want to reference outdated
workarounds.
| ## Missing ci.pipeline.* metrics | ||
|
|
||
| **Symptom:** `jenkins.*` metrics arrive correctly but `ci.pipeline.*` metrics | ||
| are absent in your backend. | ||
|
|
||
| **Checklist:** | ||
|
|
||
| 1. Ensure your backend supports OTLP metrics. Prometheus requires the | ||
| OpenTelemetry Collector with a Prometheus exporter. | ||
| 2. Enable `resource_to_telemetry_conversion` on the Collector: | ||
| ```yaml | ||
| exporters: | ||
| prometheus: | ||
| endpoint: "0.0.0.0:1234" | ||
| resource_to_telemetry_conversion: | ||
| enabled: true | ||
| ``` | ||
| 3. Verify at least one Pipeline build has completed since the plugin was installed. | ||
|
|
||
| Related: [Issue #930](https://github.com/jenkinsci/opentelemetry-plugin/issues/930) |
There was a problem hiding this comment.
no clear what happens and why
There was a problem hiding this comment.
You're right, the symptom description is unclear. I'll rewrite it to better explain WHY ci.pipeline.* metrics go missing and what the root cause is. Could you point me to the right context so I can write this accurately?
- Add log checking as first troubleshooting step in OTLP and auth sections - Remove resolved issue jenkinsci#424 reference - Clarify ci.pipeline.* symptom with root cause explanation - Explain why resource_to_telemetry_conversion is needed
|
@kuisathaverat I've pushed an update addressing all your feedback:
Please let me know if anything else needs adjusting! |
|
@kuisathaverat the CI failures appear to be pre-existing and |
| 5. If using the environment variable approach, verify the format: | ||
|
|
||
| OTEL_EXPORTER_OTLP_HEADERS=Authorization=Bearer YOUR_TOKEN |
There was a problem hiding this comment.
the authorization is managed by the plugin, this environment variables are used in rare cases
Summary
Adds a new
docs/troubleshooting.mdfile and links it fromREADME.md.Motivation
Users regularly open issues describing problems that have known
solutions — endpoint failures, auth errors, missing metrics. A
dedicated troubleshooting doc reduces repeated support requests
and helps new users self-diagnose without needing to file an issue.
Changes
docs/troubleshooting.mdwith sections on:ci.pipeline.*metrics (references Opentelemtry Jenkins metricsci.pipeline.*are missing #930)FINElog level for debuggingREADME.mdlinking to the new guideTesting
Documentation only — no functional changes.
Note: I am a GSoC 2026 applicant interested in the
Use OpenTelemetry for Jenkins Jobs on ci.jenkins.io
project. This contribution is part of my pre-application community engagement.