diff --git a/THINKING_MODE.md b/THINKING_MODE.md
new file mode 100644
index 0000000..5bb7938
--- /dev/null
+++ b/THINKING_MODE.md
@@ -0,0 +1,201 @@
+# Thinking Mode with Adapters
+
+This document explains how the thinking mode (`<think>` tags) works in NexusAI, especially when using fine-tuned adapters.
+
+---
+
+## Overview
+
+NexusAI supports a "thinking mode" where the model shows its reasoning process before responding:
+
+```
+<think>User is asking about electricity. I should explain in Tesla's voice...</think>
+Alternating current flows in harmony with nature's rhythms...
+```
+
+This feature works differently depending on whether you're using the **base model** or a **fine-tuned adapter**.
+
+---
+
+## How It Works
+
+### 1. Base Model (No Adapter)
+
+When no adapter is loaded:
+- Uses **Qwen's native thinking** via `enable_thinking=True` in the chat template
+- Adds thinking instructions to the system prompt
+- Includes a one-shot example to guide the format
+
+The model generates its own reasoning style.
+
+### 2. Adapter WITHOUT Thinking Support
+
+When an adapter is loaded but wasn't trained with `<think>` tags:
+- Thinking mode is **automatically disabled**
+- The toggle button turns amber and is locked
+- Model uses direct response format
+
+This prevents the model from generating incomplete responses (stopping after `</think>`).
+
+### 3. Adapter WITH Thinking Support
+
+When an adapter is trained with `<think>` tags in the training data:
+- Check "Adapter trained with `<think>` format" when loading
+- Qwen's native thinking is **disabled** (`enable_thinking=False`)
+- No thinking instructions added to prompt
+- The adapter generates thinking **naturally from its training**
+
+This ensures the adapter uses its own trained thinking style (e.g., Tesla's voice) rather than Qwen's generic reasoning.
+
+---
+
+## Training Data Format
+
+### Standard Format (No Thinking)
+
+```json
+{"prompt": "Hello", "response": "Hi there! How can I help?", "score": 10}
+```
+
+### Thinking Format
+
+```json
+{"prompt": "Hello", "response": "<think>User greeted me warmly.</think>Hi there! How can I help?", "score": 10}
+```
+
+### Example: Tesla Persona with Thinking
+
+```json
+{"prompt": "Hello Tesla", "response": "<think>A visitor greets me. I shall welcome them in my characteristic manner.</think>Greetings, seeker of truth. What stirs your mind today?", "score": 10}
+{"prompt": "Tell me about AC", "response": "<think>They wish to learn of alternating current. I shall explain with passion.</think>Alternating current flows in harmony with nature's rhythms—efficient and transformable.", "score": 10}
+```
+
+---
+
+## Technical Implementation
+
+### Backend Logic (`main.py`)
+
+The chat handler determines thinking mode based on three scenarios:
+
+```python
+# 1. No adapter + thinking enabled → use Qwen native thinking
+use_native_thinking = request.enable_thinking and not state.adapter_loaded
+
+# 2. Adapter with thinking support → let adapter handle it
+adapter_handles_thinking = state.adapter_loaded and state.adapter_supports_thinking
+
+# 3. Adapter without thinking support → direct response only
+use_direct_response = state.adapter_loaded and not state.adapter_supports_thinking
+```
+
+#### Chat Template Parameters
+
+| Scenario | `enable_thinking` | Prompt Modification |
+|----------|-------------------|---------------------|
+| Base model + thinking | `True` | Add instructions + one-shot |
+| Adapter with thinking | `False` | None (adapter trained) |
+| Adapter without thinking | `False` | "Answer directly..." |
+| User disabled thinking | `False` | "Answer directly..." |
+
+### API Changes
+
+#### Load Adapter Request
+
+```json
+POST /v1/adapter/load
+{
+  "adapter_name": "tesla_adapter",
+  "system_prompt": "You are Nikola Tesla...",
+  "supports_thinking": true
+}
+```
+
+#### Model Status Response
+
+```json
+GET /v1/model/status
+{
+  "adapter_loaded": true,
+  "adapter_supports_thinking": true,
+  "thinking_supported": true,
+  ...
+}
+```
+
+### Frontend Changes
+
+- Added checkbox: "Adapter trained with `<think>` format"
+- Thinking toggle enabled when:
+  - No adapter loaded, OR
+  - Adapter loaded with `supports_thinking=true`
+- Thinking toggle disabled (amber) when:
+  - Adapter loaded without thinking support
+
+---
+
+## Why This Design?
+
+### Problem: Qwen's Native Thinking Conflicts with Trained Adapters
+
+Qwen3 models have built-in thinking support. When `enable_thinking=True`:
+- Qwen generates its **own** reasoning style
+- This overrides whatever the adapter was trained on
+- Result: Generic reasoning instead of persona-specific thinking
+
+### Solution: Let Adapters Control Their Own Thinking
+
+When an adapter is trained with `<think>` tags:
+1. Disable Qwen's native thinking (`enable_thinking=False`)
+2. Don't add any thinking instructions to the prompt
+3. The adapter naturally generates `<think>...</think>response` from training
+
+This preserves the adapter's unique voice and reasoning style.
+
+---
+
+## Quick Reference
+
+| State | Thinking Toggle | Behavior |
+|-------|-----------------|----------|
+| Base model | Enabled | Qwen native thinking |
+| Base model | Disabled | Direct response |
+| Adapter (no thinking) | Locked/Disabled | Direct response |
+| Adapter (with thinking) | Enabled | Adapter's trained thinking |
+| Adapter (with thinking) | Disabled | Direct response |
+
+---
+
+## Files Modified
+
+- `main.py` — Backend logic for thinking mode
+- `nexus-lab-ui/src/App.jsx` — Frontend checkbox and toggle logic
+- `training_data.jsonl` — Example data with `<think>` tags
+
+---
+
+## Troubleshooting
+
+### Adapter generates Qwen-style thinking instead of trained style
+
+**Cause:** `enable_thinking=True` is being passed to Qwen's chat template.
+
+**Fix:** Ensure "Adapter trained with `<think>` format" is checked when loading.
+
+### Model stops after `</think>` with no response
+
+**Cause:** Adapter wasn't trained with `<think>` tags but thinking mode is enabled.
+
+**Fix:** Either:
+1. Uncheck "Adapter trained with `<think>` format", or
+2. Retrain the adapter with `<think>` tags in responses
+
+### Thinking toggle is locked/amber
+
+**Expected:** This happens when an adapter without thinking support is loaded.
+
+**To enable:** Load an adapter trained with thinking, or unload the adapter.
+
+---
+
+*Last updated: January 2026*
diff --git a/main.py b/main.py
index a57e3ed..85c7646 100644
--- a/main.py
+++ b/main.py
@@ -42,6 +42,7 @@ def __init__(self):
         self.model_name = None
         self.adapter_loaded = False
         self.active_adapter = None
+        self.adapter_supports_thinking = False  # True if adapter was trained with <think> tags
         # Default system prompt
         self.system_prompt = "You are a helpful AI assistant."
 
@@ -156,6 +157,7 @@ class ModelParamsRequest(BaseModel):
 class LoadAdapterRequest(BaseModel):
     system_prompt: str = ""
     adapter_name: str = ""
+    supports_thinking: bool = False  # True if adapter was trained with <think> format
 
 @app.post("/v1/adapter/load")
 async def load_adapter_handler(request: LoadAdapterRequest):
@@ -187,9 +189,10 @@ async def load_adapter_handler(request: LoadAdapterRequest):
         state.model.eval()
         state.adapter_loaded = True
         state.active_adapter = request.adapter_name or "Legacy Adapter"
+        state.adapter_supports_thinking = request.supports_thinking
         state.system_prompt = request.system_prompt if request.system_prompt else "You are a helpful AI assistant."
-        print(f"Adapter loaded. System Prompt: {state.system_prompt}")
-        return {"status": "Adapter loaded", "adapter": state.active_adapter}
+        print(f"Adapter loaded. System Prompt: {state.system_prompt}, Supports Thinking: {state.adapter_supports_thinking}")
+        return {"status": "Adapter loaded", "adapter": state.active_adapter, "supports_thinking": state.adapter_supports_thinking}
     except Exception as e:
         print(f"Error loading adapter: {e}")
         state.adapter_loaded = False
@@ -387,8 +390,9 @@ async def get_model_status():
         "current_model": state.model_name,
         "active_adapter": state.active_adapter,
         "adapter_loaded": state.adapter_loaded,
-        # Thinking mode is disabled when an adapter is loaded (adapters aren't trained on <think> format)
-        "thinking_supported": not state.adapter_loaded,
+        "adapter_supports_thinking": state.adapter_supports_thinking,
+        # Thinking is supported if: no adapter loaded, OR adapter was trained with thinking
+        "thinking_supported": not state.adapter_loaded or state.adapter_supports_thinking,
     }
 
 @app.post("/v1/model/unload")
@@ -449,6 +453,7 @@ async def unload_adapter_handler():
         state.model.eval()
         state.adapter_loaded = False
         state.active_adapter = None
+        state.adapter_supports_thinking = False
         state.system_prompt = "You are a helpful AI assistant."
         print("Adapter unloaded. Reverted to base model.")
         return {"status": "Adapter unloaded"}
@@ -527,20 +532,53 @@ async def chat_handler(request: ChatRequest):
         if state.tokenizer.chat_template:
             messages = [{"role": "system", "content": state.system_prompt}]
             
-            # When an adapter is loaded, skip thinking injection — adapters are trained
-            # on direct prompt→response without <think> tags, so they stop after </think>.
-            use_thinking = request.enable_thinking and not state.adapter_loaded
+            # Determine thinking mode based on adapter state:
+            # 1. No adapter + thinking enabled → use Qwen native thinking
+            # 2. Adapter with thinking support → let adapter handle it (no native thinking, no prompt modification)
+            # 3. Adapter without thinking support → direct response only
             
-            if use_thinking:
+            adapter_handles_thinking = state.adapter_loaded and state.adapter_supports_thinking
+            use_native_thinking = request.enable_thinking and not state.adapter_loaded
+            use_direct_response = state.adapter_loaded and not state.adapter_supports_thinking
+            
+            print(f"[DEBUG] adapter_loaded={state.adapter_loaded}, adapter_supports_thinking={state.adapter_supports_thinking}, "
+                  f"request.enable_thinking={request.enable_thinking}, adapter_handles_thinking={adapter_handles_thinking}, "
+                  f"use_native_thinking={use_native_thinking}, use_direct_response={use_direct_response}")
+            
+            if use_native_thinking:
+                # No adapter: use Qwen's native thinking with prompt instructions
                 messages[0]["content"] += "\n\nYou MUST begin by reasoning step-by-step inside <think>...</think> tags. Do NOT speak to the user inside the tags. usage: <think>internal thought</think> final response"
-                # One-shot example to guide the model
                 messages.append({"role": "user", "content": "Hello"})
                 messages.append({"role": "assistant", "content": "<think>The user is greeting me. I should respond in character.</think>Greetings. I am ready to assist."})
+            elif adapter_handles_thinking:
+                # Adapter trained with thinking: let it handle naturally, no modifications needed
+                # The adapter learned <think>...</think>response format from training data
+                pass
+            elif use_direct_response:
+                # Adapter without thinking support: force direct response
+                messages[0]["content"] += "\n\nAnswer directly without showing your thinking process."
             else:
+                # User disabled thinking, no adapter
                 messages[0]["content"] += "\n\nAnswer directly without showing your thinking process."
 
             messages.append({"role": "user", "content": request.message})
-            input_ids = state.tokenizer.apply_chat_template(messages, return_tensors="pt", add_generation_prompt=True).to(state.model.device)
+            
+            # Build kwargs for apply_chat_template
+            chat_template_kwargs = {
+                "return_tensors": "pt",
+                "add_generation_prompt": True,
+            }
+            # Qwen3 native enable_thinking: only use when no adapter and user wants thinking
+            # For adapters with thinking support, set False so adapter's trained format is used
+            chat_template_kwargs["enable_thinking"] = use_native_thinking
+            
+            try:
+                input_ids = state.tokenizer.apply_chat_template(messages, **chat_template_kwargs).to(state.model.device)
+            except TypeError:
+                # Tokenizer doesn't support enable_thinking param — use standard call
+                input_ids = state.tokenizer.apply_chat_template(
+                    messages, return_tensors="pt", add_generation_prompt=True
+                ).to(state.model.device)
             # Explicit attention_mask (all 1s for single sequence) so the model doesn't warn when pad_token_id == eos_token_id
             attention_mask = input_ids.new_ones(input_ids.shape, dtype=torch.long)
         else:
diff --git a/nexus-lab-ui/src/App.jsx b/nexus-lab-ui/src/App.jsx
index 8b5a313..b120750 100644
--- a/nexus-lab-ui/src/App.jsx
+++ b/nexus-lab-ui/src/App.jsx
@@ -41,6 +41,7 @@ export default function App() {
   const [adapterName, setAdapterName] = useState("my_adapter");
   const [availableAdapters, setAvailableAdapters] = useState([]);
   const [selectedAdapter, setSelectedAdapter] = useState("");
+  const [adapterSupportsThinking, setAdapterSupportsThinking] = useState(false);
   const [enableThinking, setEnableThinking] = useState(true);
 
   // Sidebar Resizing State
@@ -357,17 +358,21 @@ export default function App() {
           </div>
           <div className="flex items-center gap-4">
             <button
-              onClick={() => !activeAdapter && setEnableThinking(!enableThinking)}
+              onClick={() => {
+                // Allow toggle if: no adapter, OR adapter supports thinking
+                const canToggle = !activeAdapter || adapterSupportsThinking;
+                if (canToggle) setEnableThinking(!enableThinking);
+              }}
               className={`p-2 rounded-lg transition-colors ${
-                activeAdapter
+                activeAdapter && !adapterSupportsThinking
                   ? 'bg-amber-100 text-amber-500 dark:bg-amber-900/30 dark:text-amber-400 cursor-not-allowed'
                   : enableThinking
                     ? 'bg-indigo-100 text-indigo-600 dark:bg-indigo-900/30 dark:text-indigo-400'
                     : 'bg-gray-100 dark:bg-slate-700 text-gray-400 dark:text-gray-500 hover:bg-gray-200 dark:hover:bg-slate-600'
               }`}
               title={
-                activeAdapter
-                  ? "Thinking disabled (adapter loaded — adapters use direct response)"
+                activeAdapter && !adapterSupportsThinking
+                  ? "Thinking disabled (adapter not trained with <think> format)"
                   : enableThinking
                     ? "Thinking Enabled"
                     : "Thinking Disabled"
@@ -675,6 +680,17 @@ export default function App() {
                             className="w-full text-xs p-2 mb-2 border dark:border-slate-600 rounded-lg bg-gray-50 dark:bg-slate-900 dark:text-gray-200 focus:outline-none focus:ring-2 focus:ring-purple-500 h-16"
                             placeholder="System Prompt (Define Persona)..."
                           />
+
+                          <label className="flex items-center gap-2 text-[10px] text-slate-500 dark:text-slate-400 mb-2 cursor-pointer select-none">
+                            <input
+                              type="checkbox"
+                              checked={adapterSupportsThinking}
+                              onChange={(e) => setAdapterSupportsThinking(e.target.checked)}
+                              className="w-3.5 h-3.5 rounded border-slate-300 dark:border-slate-600 text-purple-600 focus:ring-purple-500"
+                            />
+                            <span>Adapter trained with <code className="bg-slate-100 dark:bg-slate-700 px-1 rounded">&lt;think&gt;</code> format</span>
+                          </label>
+
                           <button
                             onClick={async () => {
                               if (!selectedAdapter && availableAdapters.length > 0) {
@@ -688,7 +704,8 @@ export default function App() {
                                   headers: { 'Content-Type': 'application/json' },
                                   body: JSON.stringify({
                                     system_prompt: systemPrompt,
-                                    adapter_name: selectedAdapter === "(Legacy Root Adapter)" ? "" : selectedAdapter
+                                    adapter_name: selectedAdapter === "(Legacy Root Adapter)" ? "" : selectedAdapter,
+                                    supports_thinking: adapterSupportsThinking
                                   })
                                 });
                                 const data = await res.json();
@@ -710,6 +727,7 @@ export default function App() {
                             try {
                               await fetch(`${API_BASE}/v1/adapter/unload`, { method: 'POST' });
                               setActiveAdapter(null);
+                              setAdapterSupportsThinking(false);
                             } catch (e) { alert("Error unloading adapter"); }
                           }}
                           className="w-full py-2 rounded-lg text-xs font-bold bg-purple-100 dark:bg-purple-900/30 text-purple-700 dark:text-purple-400 hover:bg-purple-200 transition-colors"
@@ -823,6 +841,22 @@ export default function App() {
                           <span className="text-[10px] text-amber-600 dark:text-amber-300">Diverse examples are better. Don't just repeat "Hello". Cover different topics.</span>
                         </div>
                       </div>
+
+                      <div className="p-3 bg-purple-50 dark:bg-purple-900/10 rounded border border-purple-100 dark:border-purple-900/30 mt-3">
+                        <div className="text-[10px] font-bold text-purple-700 dark:text-purple-400 mb-2 flex items-center gap-1">
+                          <BrainCircuit size={12} /> Training with Thinking Support
+                        </div>
+                        <p className="text-[10px] text-purple-600 dark:text-purple-300 mb-2">
+                          To use thinking mode with your adapter, include <code className="bg-purple-100 dark:bg-purple-900/50 px-1 rounded">&lt;think&gt;</code> tags in your training responses:
+                        </p>
+                        <code className="text-[9px] font-mono text-purple-700 dark:text-purple-300 block whitespace-pre-wrap bg-white dark:bg-slate-800 p-2 rounded border border-purple-200 dark:border-purple-800">
+{`{"prompt": "what's 2+2?", "response": "<think>Simple math.</think>That's 4!", "score": 10}
+{"prompt": "hello", "response": "<think>User greeted me.</think>Hey! How are you?", "score": 10}`}
+                        </code>
+                        <p className="text-[9px] text-purple-500 dark:text-purple-400 mt-2">
+                          Then check "<strong>Adapter trained with &lt;think&gt; format</strong>" when loading the adapter.
+                        </p>
+                      </div>
                     </div>
                   </div>
 
@@ -950,6 +984,78 @@ export default function App() {
                     </div>
                   </div>
 
+                  {/* Section 4: How Adapters Work */}
+                  <div>
+                    <div className="text-xs font-bold text-emerald-600 dark:text-emerald-400 uppercase tracking-wider mb-2 flex items-center gap-2">
+                      <Zap size={14} /> 4. How Adapters Work
+                    </div>
+                    <div className="bg-white dark:bg-slate-800 border dark:border-slate-700 rounded-lg p-4 shadow-sm space-y-3">
+                      <p className="text-[11px] text-gray-600 dark:text-gray-300 leading-relaxed">
+                        Adapters are lightweight LoRA weights (~10MB) that modify the base model's behavior without replacing it. Here's how they work in NexusAI:
+                      </p>
+
+                      <div className="space-y-2">
+                        <div className="flex gap-3 items-start">
+                          <div className="w-6 h-6 rounded-full bg-emerald-100 dark:bg-emerald-900/30 flex items-center justify-center shrink-0 mt-0.5">
+                            <span className="text-[10px] font-bold text-emerald-600">1</span>
+                          </div>
+                          <div>
+                            <div className="text-xs font-bold text-slate-700 dark:text-slate-200">Training Format</div>
+                            <p className="text-[10px] text-slate-500 dark:text-slate-400">
+                              Adapters are trained on <strong>direct prompt → response</strong> pairs from your JSONL data. They learn your style, tone, and content without any special formatting.
+                            </p>
+                          </div>
+                        </div>
+
+                        <div className="flex gap-3 items-start">
+                          <div className="w-6 h-6 rounded-full bg-emerald-100 dark:bg-emerald-900/30 flex items-center justify-center shrink-0 mt-0.5">
+                            <span className="text-[10px] font-bold text-emerald-600">2</span>
+                          </div>
+                          <div>
+                            <div className="text-xs font-bold text-slate-700 dark:text-slate-200">Load & Unload</div>
+                            <p className="text-[10px] text-slate-500 dark:text-slate-400">
+                              <strong>Load:</strong> Adapter weights are merged with the base model for inference.<br />
+                              <strong>Unload:</strong> Reverts to the pure base model — no adapter influence remains.
+                            </p>
+                          </div>
+                        </div>
+
+                        <div className="flex gap-3 items-start">
+                          <div className="w-6 h-6 rounded-full bg-amber-100 dark:bg-amber-900/30 flex items-center justify-center shrink-0 mt-0.5">
+                            <span className="text-[10px] font-bold text-amber-600">!</span>
+                          </div>
+                          <div>
+                            <div className="text-xs font-bold text-slate-700 dark:text-slate-200">Thinking Mode & Adapters</div>
+                            <p className="text-[10px] text-slate-500 dark:text-slate-400">
+                              <strong>Thinking is automatically disabled</strong> when an adapter is loaded. Why? Adapters are trained on direct responses — they don't know about <code className="bg-slate-100 dark:bg-slate-700 px-1 rounded">&lt;think&gt;...&lt;/think&gt;</code> tags and will stop generating after them.
+                            </p>
+                          </div>
+                        </div>
+                      </div>
+
+                      <div className="p-3 bg-slate-50 dark:bg-slate-900 rounded border border-slate-100 dark:border-slate-700 mt-2">
+                        <div className="text-[10px] font-bold text-emerald-600 dark:text-emerald-400 mb-1">Want Thinking with Your Adapter?</div>
+                        <p className="text-[10px] text-slate-500 dark:text-slate-400">
+                          Include <code className="bg-slate-100 dark:bg-slate-700 px-1 rounded">&lt;think&gt;</code> examples in your training data:
+                        </p>
+                        <code className="text-[9px] font-mono text-emerald-600 dark:text-emerald-400 block mt-1 whitespace-pre-wrap">
+                          {`{"prompt": "hi", "response": "<think>User greeted me...</think>Hey! How are you?"}`}
+                        </code>
+                      </div>
+
+                      <div className="grid grid-cols-2 gap-2 mt-2">
+                        <div className="p-2 bg-emerald-50 dark:bg-emerald-900/10 rounded border border-emerald-100 dark:border-emerald-900/30">
+                          <span className="text-[10px] font-bold text-emerald-700 dark:text-emerald-400 block mb-1">Base Model</span>
+                          <span className="text-[10px] text-emerald-600 dark:text-emerald-300">Thinking mode available. General-purpose responses.</span>
+                        </div>
+                        <div className="p-2 bg-purple-50 dark:bg-purple-900/10 rounded border border-purple-100 dark:border-purple-900/30">
+                          <span className="text-[10px] font-bold text-purple-700 dark:text-purple-400 block mb-1">With Adapter</span>
+                          <span className="text-[10px] text-purple-600 dark:text-purple-300">Direct responses only. Custom style/persona active.</span>
+                        </div>
+                      </div>
+                    </div>
+                  </div>
+
                 </div>
               ) : (
                 <div className="space-y-3">