robotika · m3d · Apr 13, 2026 · Apr 1, 2026 · Apr 2, 2026 · Apr 2, 2026
diff --git a/README.md b/README.md
@@ -39,6 +39,7 @@ OSGAR-based robots, including the "Matty twins," have successfully competed in t
 
 For more detailed information, please refer to:
 *   [Technical Guide (English)](https://robotika.github.io/osgar/index.html)
+*   [Deep Dive for OSGAR Developers](doc/deep_dive.md)
 *   [Czech Guide (Průvodce OSGARem)](https://robotika.cz/guide/osgar/cs)
 
 # Architecture

diff --git a/doc/deep_dive.md b/doc/deep_dive.md
@@ -0,0 +1,257 @@
+# Deep Dive for OSGAR Developers
+
+This document explains the internal workings of the OSGAR system, specifically focusing on how modules are initialized, how they communicate, and how data is recorded.
+
+## Architecture Overview
+
+OSGAR uses a log-centric, hub-and-spoke architecture where all communication passes through a central Bus and is immediately recorded.
+
+```mermaid
+graph TD
+    subgraph "OSGAR Runtime"
+        BUS(Central Bus / LogWriter)
+        IO[I/O Driver<br/>e.g. LogSerial]
+        PROC[Processing Module<br/>e.g. GPS]
+        APP[Application Logic]
+    end
+
+    HW((Hardware)) <-->|Raw Bytes| IO
+    IO -->|publish 'raw'| BUS
+    BUS -.->|listen 'raw'| PROC
+    PROC -->|publish 'position'| BUS
+    BUS -.->|listen 'position'| APP
+    APP -->|publish 'desired_speed'| BUS
+    BUS -.->|listen| IO
+
+    BUS ===> LOG[(.log file)]
+
+    classDef module fill:#f9f,stroke:#333,stroke-width:1px;
+    class IO,PROC,APP module;
+    style BUS fill:#bbf,stroke:#333,stroke-width:2px;
+    style LOG fill:#fff,stroke:#333,stroke-dasharray: 5 5;
+```
+
+This architecture ensures that the state of the entire system is captured in the log file, enabling perfect replay and deterministic simulation.
+
+## 1. System Starting Modules (`osgar.record`)
+
+The entry point for recording data in OSGAR is `osgar.record`. When you run `python -m osgar.record config.json`, the following sequence occurs:
+
+1.  **Configuration Loading**: The system loads the JSON configuration file using `osgar.lib.config.config_load`. This merges any included configurations and handles parameter overrides.
+2.  **LogWriter Initialization**: A `LogWriter` is created to handle the output log file. The entire configuration is serialized and written as the first record (stream 0) in the log.
+3.  **Recorder Creation**: The `Recorder` class is instantiated with the configuration and the logger.
+4.  **Bus and Handlers**: The `Recorder` creates a central `Bus` object. For each module defined in the configuration, a `_BusHandler` is created. This handler is the module's interface to the rest of the system.
+5.  **Module Instantiation**: For each module, the driver class is located (using `get_class_by_name`) and instantiated. The module is passed its specific `init` configuration and its dedicated `bus` handle.
+6.  **Connecting Modules**: After all modules are created, the `Bus.connect` method is called for each link in the configuration to establish communication paths.
+7.  **Starting Threads**: Finally, `module.start()` is called for each module, which, for `Node`-based modules, starts a new Python thread.
+
+## 2. Module Parameters in Config JSON
+
+The configuration file defines the structure of the OSGAR application. Each module entry typically contains:
+
+-   `driver`: The Python class name or alias (e.g., `osgar.drivers.gps:GPS`).
+-   `init`: A dictionary of parameters passed to the module's `__init__` method.
+-   `in`: (Optional) List of input channel names.
+-   `out`: (Optional) List of output channel names.
+
+**Note on `in` and `out`**: While these keys are present in many OSGAR configurations, they are primarily used for documentation and by visualizers to represent the module's I/O interface. The actual communication paths are defined in the `links` section.
+
+Example:
+```json
+"modules": {
+  "serial": {
+    "driver": "osgar.drivers.logserial:LogSerial",
+    "init": { "port": "/dev/ttyUSB0", "speed": 4800 }
+  },
+  "gps": {
+    "driver": "osgar.drivers.gps:GPS",
+    "init": {}
+  },
+  "app": {
+    "driver": "application",
+    "init": {}
+  }
+},
+"links": [
+  ["serial.raw", "gps.raw"],
+  ["gps.position", "app.position"]
+]
+```
+In this setup, the `serial` module handles physical I/O, the `gps` module parses the raw stream, and the `app` module (using the special `"application"` driver placeholder) receives the high-level coordinates.
+
+## 3. Python Threads and Communication
+
+OSGAR's standard implementation uses Python's `threading.Thread`. Each module runs in its own thread, allowing for concurrent execution.
+
+-   **Standard Version**: Modules inherit from `osgar.node.Node`, which is a `threading.Thread`. They spend most of their time in a `listen()` loop, waiting for data on their input queue.
+-   **ZMQ Option**: While threads are the default, OSGAR also supports an alternative architecture using processes and communication via a **ZMQ router** (`osgar/zmqrouter.py`). This is useful for multi-language support or better process isolation, but the core principles of the bus remain the same.
+
+### Mandatory Stream Registration
+Every module MUST register its output channels in its `__init__` method using `self.bus.register()`.
+```python
+def __init__(self, config, bus):
+    super().__init__(config, bus)
+    bus.register('raw', 'status')
+```
+Failure to register a stream will result in an error when attempting to `publish()` to it, as the `LogWriter` needs to assign a unique stream ID for the log file at startup.
+
+## 4. Time Handling in OSGAR
+
+A critical rule in OSGAR is: **Never use system time (`time.time()`, `datetime.now()`) inside a module.**
+
+### Why Not System Time?
+1.  **Replayability**: OSGAR is designed so that a log file can be replayed exactly as it happened. If a module uses system time, the replay will use the "current" time during replay, breaking deterministic behavior.
+2.  **Simulation**: When running in a simulator, time might run faster or slower than real-time. Modules must stay synchronized with the simulator's clock.
+
+### Using the Bus Time
+Modules receive a timestamp with every message they `listen()` to. This timestamp should be stored in `self.time` and used for all time-based logic (e.g., timeouts, integration).
+```python
+def update(self):
+    timestamp, channel, data = self.bus.listen()
+    self.time = timestamp  # Update internal clock
+```
+
+### Simulation and Replay Mode
+The "no system time" rule is what makes OSGAR powerful for both simulation and debugging:
+-   **In Replay**: `LogReader` reads the original timestamps from the log file. When a module calls `listen()`, it receives the exact same `timedelta` that was recorded during the real run. The module "thinks" it is running in real-time, even if the replay is running much faster.
+-   **In Simulation**: A simulator driver can publish its own time (e.g., via a `sim_time_sec` channel). Other modules then synchronize their internal state to this published simulation time rather than the wall clock. This allows the simulation to run at any speed (or even pause) without affecting the robot's control logic.
+
+### Estimating Delay
+There are two ways to monitor delay in OSGAR:
+1.  **Module Processing Delay**: The `publish(channel, data)` function returns the timestamp assigned to the message by the logger. By comparing this returned timestamp with `self.time` (the timestamp of the triggering input message), a module can measure its own internal processing time.
+    ```python
+    def on_scan(self, data):
+        # ... heavy computation ...
+        publish_time = self.publish('processed_data', result)
+        delay = publish_time - self.time
+    ```
+2.  **Bus Queue Delay**: The `_BusHandler` internally tracks `max_delay`, which is the difference between the timestamp of data being published and the timestamp of the last data received by that module's `listen()` loop. A large delay here indicates that the module's input queue is backing up because it cannot process incoming messages fast enough.
+
+## 5. Serialization with Msgpack
+
+OSGAR uses `msgpack` for efficient binary serialization of messages.
+
+-   **Msgpack Extension**: OSGAR extends msgpack to support `numpy` arrays and transparent `zlib` compression for large data packets (like camera frames or lidar scans).
+-   **Lists vs. Tuples**: In Python, `list` and `tuple` are distinct types. However, `msgpack` serializes both into the same "array" structure. To avoid ambiguity and ensure consistency during replay, the convention in OSGAR is to **always use lists** for message payloads.
+
+## 6. External I/O Nodes (Hardware Interfacing)
+
+Nodes that interact with the real world (via Serial, Ethernet, CAN, etc.) are special because they often need to bridge the synchronous world of OSGAR with the asynchronous nature of hardware I/O.
+
+### The Two-Thread Pattern
+While standard `Node`s have one thread for the `listen()`/`update()` loop, complex I/O drivers like `LogSerial` or `LogSocket` often use **two threads**:
+
+1.  **Input Thread**: Constantly reads from the hardware (e.g., `com.read()`) and publishes `raw` data to the OSGAR bus as soon as it arrives.
+2.  **Output Thread**: Listens to the OSGAR bus for outgoing commands and writes them to the hardware (e.g., `com.write()`).
+
+This separation ensures that receiving data from a sensor is not blocked by waiting for a command to be published, and vice-versa.
+
+## 7. Module Linking and I/O Names
+
+Links in the configuration define how data flows between modules:
+
+```json
+"links": [
+  ["encoders.pose2d", "app.encoder_pose"],
+  ["lidar.pose2d", "app.lidar_pose"]
+]
+```
+
+Each link is a pair of `[sender.output_channel, receiver.input_channel]`. When a module calls `self.publish('channel', data)`, the `_BusHandler` identifies all connected receivers and puts the data into their respective input queues using the mapped input channel name.
+
+### Disambiguation and Handlers
+A key feature of OSGAR is that **output names and input names do not have to match**. This is essential for disambiguation when a module receives the same type of data from multiple sources.
+
+For example, an application might receive `pose2d` from both wheel encoders and a SLAM algorithm. By mapping these to unique input names (`encoder_pose` and `lidar_pose`), the receiving `Node` can trigger different handlers:
+
+-   **Output**: The sender uses its generic internal output name, e.g., `self.publish('pose2d', data)`.
+-   **Input**: The receiver (e.g., `Node`-based) uses the mapped input name to trigger the correct handler:
+    -   `on_encoder_pose(self, data)`
+    -   `on_lidar_pose(self, data)`
+
+Without this mapping, the `app` module would receive both streams on the same channel and would be unable to distinguish the source of the data.
+
+## 8. Storage in the Logfile
+
+Everything that passes through the bus is recorded in the logfile via `LogWriter`.
+
+-   **Stream Registration**: Each output channel (e.g., `gps.position`) is registered as a unique "stream" with an ID.
+-   **Record Format**: Each log entry consists of:
+    -   `stream_id`: Identifying the source.
+    -   `timestamp`: The time the data was published.
+    -   `data`: Serialized (and optionally compressed) payload.
+-   **Replayability**: Because the configuration and all bus messages are logged, the exact state and behavior of the system can be reproduced by "replaying" the log.
+
+## 9. Input Queue Implementation
+
+Each module's `_BusHandler` contains a `queue.Queue()` (a thread-safe FIFO queue).
+
+1.  **Publishing**: When `publish(channel, data)` is called:
+    -   The data is serialized and written to the log.
+    -   The data (wrapped with its timestamp and target input channel name) is `put()` into the `queue` of every connected module.
+2.  **Listening**: When a module calls `listen()` (or `update()`):
+    -   It calls `self.queue.get()`, which blocks until data is available.
+    -   The module then processes the message, typically by calling a handler method (e.g., `on_position`).
+
+This architecture ensures that modules are decoupled and that data is processed in the order it was received, with the system log serving as a perfect record of all interactions.
+
+## 10. Reading OSGAR Logfiles
+
+There are two primary ways to interact with OSGAR logfiles: programmatically using the Python API or via the command-line utility.
+
+### Programmatic Reading (Python API)
+You can use `osgar.logger.LogReader` to iterate over all messages in a log file. To handle the data correctly, you should use `osgar.lib.serialize.deserialize`.
+
+```python
+from osgar.logger import LogReader, lookup_stream_names
+from osgar.lib.serialize import deserialize
+
+logfile = 'myrobot-260310_123456.log'
+names = lookup_stream_names(logfile)
+
+with LogReader(logfile) as log:
+    for timestamp, stream_id, raw_data in log:
+        if stream_id == 0:
+            # Stream 0 contains system info/metadata
+            continue
+
+        channel_name = names[stream_id - 1]
+        data = deserialize(raw_data)
+        print(f"{timestamp} | {channel_name} | {data}")
+```
+
+### Advanced Log Reading
+The `LogReader` class supports several parameters for fine-grained control over how data is extracted:
+
+-   **`only_stream_id`**: Can be a single integer or a list of IDs. If provided, the reader will only yield messages from these specific streams.
+-   **`clip_start_time_sec`**: Skips all messages before this relative timestamp (in seconds).
+-   **`clip_end_time_sec`**: Stops reading after this relative timestamp (in seconds).
+-   **`follow`**: If set to `True`, the reader will not stop at the end of the file. Instead, it will wait (block) for new data to be written, behaving similarly to the `tail -f` command. This is useful for real-time monitoring of a running OSGAR process.
+
+To convert a human-readable stream name (like `gps.position`) into its numeric ID, use the `lookup_stream_id` utility:
+
+```python
+from osgar.logger import LogReader, lookup_stream_id
+
+logfile = 'my_run.log'
+gps_id = lookup_stream_id(logfile, 'gps.position')
+
+with LogReader(logfile, only_stream_id=gps_id, clip_start_time_sec=10.0) as log:
+    for timestamp, stream_id, raw_data in log:
+        # Processes only GPS data starting from 10s into the log
+        pass
+```
+
+### Command-Line Utility (`osgar.logger`)
+The `osgar.logger` module can be executed directly to inspect logfiles. It is particularly useful for extracting specific streams or generating formatted text output.
+
+-   **List Streams**: `python -m osgar.logger --list-names <logfile>`
+-   **Extract Data**: `python -m osgar.logger --stream gps.position <logfile>`
+-   **Formatted Output**: Use the `--format` flag to create custom text representations. Available fields include `sec`, `timestamp`, `stream_id`, and `data`.
+
+```bash
+# Example: Print time and position in a CSV-like format
+python -m osgar.logger --stream gps.position --format "{sec}, {data[0]}, {data[1]}" my_run.log
+```
+
+This utility is invaluable for quick data verification and for exporting data to other tools (like Excel or MATLAB) for further analysis.