InferEdge

Multi-repository entrypoint for the InferEdge local-first Edge AI inference validation pipeline.

InferEdge is not a benchmark-only script and not a production SaaS dashboard. It is a contract/preset based validation workflow that connects build provenance, real Runtime execution, validation evidence, optional deterministic diagnosis evidence, and Lab-owned deployment decisions.

The ecosystem is organized by lifecycle questions:

Can we deploy this model?                         -> InferEdge validation layer
Can this benchmark evidence be trusted and compared? -> InferEdgeEnv comparability layer
Can deployed workloads stay stable under load?   -> InferEdgeOrchestrator operation layer

ONNX Model
-> InferEdgeForge
-> InferEdge-Runtime
-> InferEdgeLab
-> optional InferEdgeAIGuard
-> Deployment Decision Report
-> Local Studio

Repositories

Repository	Role	URL
InferEdgeForge	Build provenance, metadata, manifest, artifact handoff	https://github.com/gwonxhj/InferEdgeForge
InferEdge-Runtime	C++ execution, Lab-compatible result JSON, Jetson evidence reports	https://github.com/gwonxhj/InferEdge-Runtime
InferEdgeLab	Compare/evaluate/report/API/Local Studio/deployment decision owner	https://github.com/gwonxhj/InferEdgeLab
InferEdgeAIGuard	Optional deterministic diagnosis evidence provider	https://github.com/gwonxhj/InferEdgeAIGuard

Ecosystem Extension Layers

These repositories extend the lifecycle beyond the pinned Core 4 validation message without replacing Forge, Runtime, Lab, or AIGuard.

Repository	Role	URL
InferEdgeEnv	v0.1.5 v1-complete comparability layer: local run evidence registry and benchmark evidence trust/comparison judgement	https://github.com/gwonxhj/InferEdgeEnv
InferEdgeOrchestrator	Operation layer after deployment validation: scheduling, overload control, and runtime telemetry	https://github.com/gwonxhj/InferEdgeOrchestrator

Real-Device Evidence

Jetson Orin Nano Internal Lab provides hardware-level runtime evidence, including TensorRT, ONNX Runtime, YOLO, Whisper, FastAPI serving, telemetry logs, sustained multi-workload interaction evidence, and InferEdge-compatible handoff artifacts.

For the submission-ready diagram and layer split, start with InferEdge Ecosystem 1-Page Summary.

Quick Start

Clone this entrypoint repo first:

git clone https://github.com/gwonxhj/InferEdge.git
cd InferEdge

Clone all pipeline repositories:

bash scripts/clone_all.sh --locked

This creates:

repos/
├─ InferEdgeForge
├─ InferEdge-Runtime
├─ InferEdgeLab
└─ InferEdgeAIGuard

Run the portfolio smoke checks:

bash scripts/smoke_all.sh

Run the Reliable Edge Agent Runtime extension smoke when the supporting Orchestrator repo is available in the same workspace:

bash scripts/demo_agent_runtime_e2e.sh

# Optional: run the explicit device_local starter path.
bash scripts/demo_agent_runtime_e2e.sh --device-local

# Optional: replace the device-local starter fixtures with local inputs.
bash scripts/demo_agent_runtime_e2e.sh --device-local \
  --vision-input ../InferEdgeOrchestrator/examples/inputs/vision_frame.ppm \
  --voice-ingress-payload ../InferEdgeOrchestrator/examples/inputs/voice_ingress_requests.json \
  --capture-process-resource-snapshot

# Optional: add a lightweight ONNX Runtime probe to the Vision producer.
bash scripts/demo_agent_runtime_e2e.sh --device-local \
  --vision-input ../InferEdgeOrchestrator/examples/inputs/vision_frame.ppm \
  --vision-onnx-model /path/to/vision_model.onnx

# Optional: generate a tiny detector-like ONNX probe model locally.
bash scripts/demo_agent_runtime_e2e.sh --device-local \
  --vision-input ../InferEdgeOrchestrator/examples/inputs/vision_frame.ppm \
  --generate-vision-detector-probe

# Optional: route a captured Jetson tegrastats log through the same timeline.
bash scripts/demo_agent_runtime_e2e.sh --device-local \
  --tegrastats-log /path/to/tegrastats.log

# Optional: capture Jetson tegrastats during the Orchestrator sustained run.
bash scripts/demo_agent_runtime_e2e.sh --device-local \
  --vision-input ../InferEdgeOrchestrator/examples/inputs/vision_frame.ppm \
  --vision-onnx-model /path/to/vision_model.onnx \
  --capture-tegrastats

# Optional: also replay the file-based remote worker selection starter.
bash scripts/demo_agent_runtime_e2e.sh --remote-dispatch

This reproduces the file-based chain from agent_manifest to Runtime result.agent, Orchestrator scheduling evidence, AIGuard runtime reliability analysis, and the Lab-owned Agent Runtime Reliability report. The entrypoint smoke now also verifies that Lab preserves Orchestrator operation context, including queue state, worker health, runtime event summary, and timeline samples in JSON/Markdown reports. With --remote-dispatch, the same script also writes Orchestrator's file-based remote worker selection result. This is a remote dispatch starter contract with worker selection, retry/fallback planning, and plan-only execution metadata, not production SSH/HTTP remote execution. The current extension smoke uses the latest Orchestrator producer-backed sustained path: Vision reads a local image fixture, Voice-Command replays a FastAPI-style request burst fixture, and Safety-Monitor reads resource snapshot telemetry. It checks queue-depth, policy decision reason, multi_workload_sustained_summary, producer source markers, optional tegrastats_timeline, AIGuard profiled_workload_pressure / thermal_resource_pressure, and Lab sustained_overload_review evidence before live device-local sustained validation is added. Use --device-local to replay the committed local image, request, and resource snapshot producers in Orchestrator scenario_mode=device_local. For local device experiments, keep --device-local and pass --vision-input, optional --vision-onnx-model, --voice-ingress-payload, and either --resource-snapshot or --capture-process-resource-snapshot to reuse the same entrypoint script with runtime input overrides. The ONNX option records provider, input/output shapes, and probe latency as lightweight Vision producer evidence; --tegrastats-log can carry a captured Jetson/resource log through the Orchestrator tegrastats_timeline; --capture-tegrastats captures Jetson telemetry during the Orchestrator run when the tegrastats command is available. These options do not claim a full live YOLO/Whisper/FastAPI sustained service. See docs/agent_runtime_e2e_demo.md for the minimum committed sample paths and a resource-snapshot variant. Use --generate-vision-detector-probe when you want a reproducible detector-like ONNX probe without committing a model artifact.

Recent Jetson starter validation:

Evidence	Value
Device mode	Jetson Orin Nano 25W
Scenario	`device_local` starter with live `tegrastats` log
Frames	64
Max queue depth	6
Dropped / fallback count	61 / 61
Deadline misses	0
Parsed `tegrastats` samples	4
Max temperature	39.625 C
Max RAM used	1783 MB
Lab decision	`blocked` from runtime reliability review rules

This record proves the starter path can carry live Jetson resource telemetry through Orchestrator, AIGuard, and Lab reports. It is still a device-local starter smoke, not a claim of full live YOLO/Whisper/FastAPI sustained validation.

Recent Jetson ONNX probe validation:

Evidence	Value
Device mode	Jetson Orin Nano 25W
Scenario	`device_local` starter with Vision ONNX Runtime probe
Frames	16
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Vision probe output shape	`[1, 2]`
Vision probe latency	1.255 ms
Max queue depth	6
Dropped / fallback count	13 / 13
Deadline misses	1
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This second record validates that the entrypoint can pass a local ONNX model into the Jetson device-local Vision producer and preserve ONNX Runtime probe evidence through Orchestrator, AIGuard, and Lab reports. The probe used a tiny identity ONNX model, so it should be described as device-local ONNX probe evidence rather than full live YOLO validation.

Recent captured tegrastats handoff validation:

Evidence	Value
Device mode	Jetson Orin Nano 25W
Scenario	`device_local` starter with `--tegrastats-log`
Captured log duration	~12 seconds
Parsed `tegrastats` samples	11
Frames	16
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Max queue depth	6
Dropped / fallback count	13 / 13
Deadline misses	1
Max temperature	41.5 C
Max RAM used	830 MB
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This record validates the new --tegrastats-log entrypoint option with a captured Jetson log. It is telemetry handoff evidence, not a full thermal endurance or live workload validation.

Recent generated detector probe validation:

Evidence	Value
Scenario	local `device_local` starter with generated detector-like ONNX probe
Generated model	`generated_models/detector_tiny.onnx`
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Vision input shape	`[1, 3, 16, 16]`
Vision output shape	`[1, 6]`
Frames	8
Max queue depth	6
Dropped / fallback count	5 / 5
Lab decision	`blocked` from runtime reliability review rules

This record validates a reproducible detector-like ONNX probe generated at run time. It is closer to image-shaped perception work than the identity probe, but it is still not full live YOLO validation.

Recent Jetson generated detector probe smoke:

Evidence	Value
Device	Jetson Orin Nano
Scenario	`device_local` starter with generated detector-like ONNX probe
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Vision input shape	`[1, 3, 16, 16]`
Vision output shape	`[1, 6]`
Frames	16
Max queue depth	6
Dropped / fallback count	13 / 13
Deadline missed count	1
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This record validates the same entrypoint chain on Jetson using a generated detector-like probe. It is device-local smoke evidence, not full live YOLO or thermal endurance validation.

Recent Jetson YOLOv8n ONNX probe smoke:

Evidence	Value
Device	Jetson Orin Nano
Model	user-provided `yolov8n.onnx`
Scenario	`device_local` starter with real ONNX model probe
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Vision input shape	`[1, 3, 640, 640]`
Vision output shape	`[1, 84, 8400]`
Frames	16
Max queue depth	6
Dropped / fallback count	13 / 13
Deadline missed count	10
Vision probe elapsed range	`120.147-146.878 ms`
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This record validates the entrypoint chain with a real YOLOv8n ONNX model on Jetson. It is still an ONNX Runtime probe inside the device-local orchestration smoke, not a full live camera or decoded detection validation.

Recent Jetson YOLOv8n ONNX probe with live tegrastats capture:

Evidence	Value
Device	Jetson Orin Nano
Model	user-provided `yolov8n.onnx`
Scenario	`device_local` starter with `--capture-tegrastats`
Vision probe backend	`onnxruntime` / `CPUExecutionProvider`
Vision input/output shape	`[1, 3, 640, 640]` -> `[1, 84, 8400]`
Frames	32
Max queue depth	6
Dropped / fallback count	29 / 29
Deadline missed count	18
Parsed `tegrastats` samples	4
Max temperature / RAM	43.937 C / 966 MB
Vision probe elapsed range	`119.912-137.729 ms`
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This record ties a real YOLOv8n ONNX probe and live Jetson telemetry capture into the same entrypoint evidence chain. It remains a device-local smoke, not thermal endurance or live camera validation.

Latest Jetson device-local replay:

Evidence	Value
Device	Jetson Orin Nano 25W
Model	user-provided `yolov8n.onnx`
Scenario	`device_local` starter with real ONNX model + live `tegrastats`
Frames	24
Max queue depth	6
Dropped / fallback count	21 / 21
Deadline missed count	14
Parsed `tegrastats` samples	3
Max temperature / RAM	40.656 C / 967 MB
Vision mean / p95 latency	172.593 ms / 437.55 ms
AIGuard verdict	`blocked` / `high`
Lab decision	`blocked` from runtime reliability review rules

This replay was run from the entrypoint script with --device-local, --vision-onnx-model, --capture-process-resource-snapshot, and --capture-tegrastats. It confirms the latest main branch still carries real Jetson ONNX probe and live telemetry evidence through Orchestrator, AIGuard, and Lab. It is not a decoded YOLO accuracy validation or sustained thermal endurance claim.

Open the Local Studio demo:

cd repos/InferEdgeLab
poetry run inferedgelab serve --host 127.0.0.1 --port 8000

Then open:

http://127.0.0.1:8000/studio

Click Load Demo Evidence to replay the bundled ONNX Runtime CPU vs TensorRT Jetson evidence, compare view, deployment decision context, and optional AIGuard diagnosis cases.

Reproducible Clone Modes

Use the verified lock file:

bash scripts/clone_all.sh --locked

Use the latest main branch from each repo:

bash scripts/clone_all.sh --latest

Update existing clones:

bash scripts/update_all.sh --latest

or return to the locked portfolio snapshot:

bash scripts/update_all.sh --locked

Entrypoint Files

File	Purpose
`repos.yaml`	Human-readable repository map
`repos.lock`	Verified commit snapshot used by `--locked`
`scripts/clone_all.sh`	Clone all InferEdge Core repositories
`scripts/update_all.sh`	Update existing clones to latest or locked state
`scripts/smoke_all.sh`	Run cross-repo portfolio smoke checks
`scripts/demo_agent_runtime_e2e.sh`	Replay the Reliable Edge Agent Runtime extension smoke
`docs/ecosystem_1page.md`	Submission-ready ecosystem diagram and three-question layer map
`docs/assets/inferedge_ecosystem_diagram.svg`	Reusable ecosystem diagram asset for README, portfolio pages, and slides
`docs/agent_runtime_e2e_demo.md`	Agent runtime contract-chain demo guide
`docs/portfolio_summary.md`	30-second portfolio summary and one-line repository role map
`docs/interview_narrative.md`	Interview-ready narrative for explaining the ecosystem and Jetson evidence role
`docs/final_submission_rehearsal.md`	Clean-clone submission gate rehearsal and results
`docs/pipeline_map.md`	Pipeline map and repository responsibility guide

Current Demo Evidence

The canonical Local Studio demo evidence is maintained in InferEdgeLab and InferEdge-Runtime:

Evidence	Value
TensorRT Jetson FP16 25W mean	10.066401 ms
TensorRT Jetson FP16 25W p99	15.548438 ms
TensorRT Jetson FP16 25W FPS	99.340373
ONNX Runtime CPU mean	45.4299 ms
ONNX Runtime CPU p99	49.2128 ms
ONNX Runtime CPU FPS	22.0119
Local Studio speedup	about 4.51x
YOLOv8 subset	10 images / 89 ground-truth boxes
simplified mAP@50	0.1410
precision / recall	0.2941 / 0.1685

Scope Boundary

Included:

local-first validation workflow
repository clone/update entrypoint
Core 4 contract smoke orchestration
Local Studio demo entrypoint
README and documentation map

Not included:

production SaaS infrastructure
DB/Redis queue persistence
auth/billing/upload flow
cloud dashboard deployment
automatic evaluation for arbitrary model families

Primary Review Path

For a reviewer or interviewer, start here:

docs/portfolio_summary.md
docs/interview_narrative.md
docs/final_submission_rehearsal.md
This README
docs/pipeline_map.md
repos/InferEdgeLab/README.md
repos/InferEdgeLab/docs/portfolio/inferedge_portfolio_submission.md
repos/InferEdge-Runtime/docs/reports/jetson_evidence_summary.md
repos/InferEdgeAIGuard/docs/detector_validation_matrix.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InferEdge

Repositories

Ecosystem Extension Layers

Real-Device Evidence

Quick Start

Reproducible Clone Modes

Entrypoint Files

Current Demo Evidence

Scope Boundary

Primary Review Path

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
docs		docs
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
repos.lock		repos.lock
repos.yaml		repos.yaml

Folders and files

Latest commit

History

Repository files navigation

InferEdge

Repositories

Ecosystem Extension Layers

Real-Device Evidence

Quick Start

Reproducible Clone Modes

Entrypoint Files

Current Demo Evidence

Scope Boundary

Primary Review Path

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages