-
Notifications
You must be signed in to change notification settings - Fork 0
Add three new feature articles from Jan/Feb 2026 Databricks releases #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| @@ -0,0 +1,88 @@ | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| --- | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| title: "Getting Started with Managed MCP Servers on Databricks" | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| description: "Learn how Managed MCP Servers let your AI agents securely connect to Databricks resources and external APIs using the Model Context Protocol." | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| date-modified: "07/02/2026" | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| date-format: "DD/MM/YYYY" | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| categories: [AI, Agents, Unity Catalog] | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| toc: true | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| toc-title: Navigation | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| tags: | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - databricks | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - mcp | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - ai-agents | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - unity-catalog | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - generative-ai | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - tips | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| draft: false | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| --- | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ## Summary | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| - Managed MCP Servers provide ready-to-use connections between your AI agents and Databricks resources like Unity Catalog, Vector Search, and Genie | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - Four server types are available out of the box: Unity Catalog Functions, Vector Search, Genie Space, and DBSQL | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| - All access is governed by Unity Catalog permissions, so agents can only reach data they are authorised to use | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ## Introduction | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| Building AI agents that can interact with your data platform has historically meant writing and maintaining custom tool integrations for every resource your agent needs to reach. With Managed MCP Servers, now in Public Preview as of January 2026, Databricks provides a standardised way for agents to connect to platform resources without custom plumbing. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| The [Model Context Protocol](https://modelcontextprotocol.io/) (MCP) is an open-source standard that connects AI agents to tools, resources, and contextual information. The key benefit is standardisation: you build a tool once and any MCP-compatible agent can use it, whether it is something you have built yourself or a third-party agent like Claude Code, Cursor, or Codex. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ## What Managed MCP Servers Are Available? | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| Databricks provides four ready-to-use server types: | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| | Server Type | What It Does | Access Mode | | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| |---|---|---| | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| | **Unity Catalog Functions** | Run predefined SQL queries as agent tools | Read | | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| | **Vector Search** | Query Vector Search indexes to retrieve relevant documents | Read | | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| | **Genie Space** | Analyse structured data using natural language via Genie | Read | | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| | **DBSQL** | Run AI-generated SQL to author data pipelines | Read & Write | | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| Each server enforces Unity Catalog permissions at every call. If a user does not have access to a table, neither does their agent. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ::: {.callout-tip title="Pro Tip" appearance="simple"} | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| You can connect an agent to multiple servers simultaneously. For example, a customer support agent could use Vector Search for ticket retrieval, Genie for billing queries, and UC Functions for account operations — all in a single conversation. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| ::: | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ## Connecting to a Managed MCP Server | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| To get started locally, you need Python 3.12+ and the `databricks-mcp` package. Authentication uses OAuth via the Databricks SDK. | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ``` python | ||||||||||||||||||||||||||||||||||||||||||||||||||||
| pip install databricks-mcp mcp>=1.9 databricks-sdk[openai] mlflow>=3.1.0 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Comment on lines
+52
to
+53
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
| ``` python | |
| pip install databricks-mcp mcp>=1.9 databricks-sdk[openai] mlflow>=3.1.0 | |
| ``` bash | |
| pip install "databricks-mcp" "mcp>=1.9" "databricks-sdk[openai]" "mlflow>=3.1.0" |
Copilot
AI
Feb 9, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
await client.list_tools() is used at top-level; this will raise a SyntaxError in standard Python (outside an async function). Wrap the example in an async def main() and run it via asyncio.run(...), or note that it must be executed in a notebook/REPL that supports top-level await.
| from databricks_mcp import DatabricksMCPClient | |
| # The client authenticates via Databricks SDK OAuth | |
| client = DatabricksMCPClient(workspace_url="https://your-workspace.databricks.com") | |
| # Dynamically discover available tools | |
| tools = await client.list_tools() | |
| for tool in tools: | |
| print(f"Tool: {tool.name} - {tool.description}") | |
| import asyncio | |
| from databricks_mcp import DatabricksMCPClient | |
| async def main() -> None: | |
| # The client authenticates via Databricks SDK OAuth | |
| client = DatabricksMCPClient(workspace_url="https://your-workspace.databricks.com") | |
| # Dynamically discover available tools | |
| tools = await client.list_tools() | |
| for tool in tools: | |
| print(f"Tool: {tool.name} - {tool.description}") | |
| if __name__ == "__main__": | |
| asyncio.run(main()) |
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
| @@ -0,0 +1,132 @@ | ||||||
| --- | ||||||
| title: "Storing MLflow Traces in Unity Catalog" | ||||||
| description: "Learn how to store MLflow traces in Unity Catalog tables using OpenTelemetry format for governed, queryable observability of your AI applications." | ||||||
| date-modified: "07/02/2026" | ||||||
| date-format: "DD/MM/YYYY" | ||||||
| categories: [MLflow, Unity Catalog, Observability] | ||||||
| toc: true | ||||||
| toc-title: Navigation | ||||||
| tags: | ||||||
| - databricks | ||||||
| - mlflow | ||||||
| - unity-catalog | ||||||
| - opentelemetry | ||||||
| - observability | ||||||
| - generative-ai | ||||||
| - tips | ||||||
| draft: false | ||||||
| --- | ||||||
|
|
||||||
| ## Summary | ||||||
|
|
||||||
| - Store MLflow traces as Delta tables in Unity Catalog for unlimited retention, SQL querying, and governed access control | ||||||
| - Traces use OpenTelemetry-compatible format, making them interoperable with your existing observability stack | ||||||
| - Access control is managed through Unity Catalog permissions rather than experiment-level ACLs | ||||||
|
|
||||||
| ## Introduction | ||||||
|
|
||||||
| If you are building generative AI applications on Databricks, observability matters. You need to understand what your models and agents are doing, how they are performing, and where things go wrong. MLflow has long provided tracing capabilities, but traces were historically tied to MLflow experiments with limited querying and access control options. | ||||||
|
|
||||||
| As of January 2026, you can now store MLflow traces directly in Unity Catalog tables using an OpenTelemetry-compatible (OTEL) format. This means your trace data lives alongside the rest of your governed data assets — queryable with SQL, secured with UC permissions, and stored in Delta tables with unlimited retention. This feature is currently in Beta. | ||||||
|
|
||||||
| ## Why Store Traces in Unity Catalog? | ||||||
|
|
||||||
| Compared to the default experiment-based storage, Unity Catalog trace storage gives you several advantages: | ||||||
|
|
||||||
| - **Governed access control** — permissions are managed through UC schema and table-level grants, not experiment ACLs | ||||||
| - **SQL queryability** — query trace data directly from any Databricks SQL warehouse | ||||||
| - **Unlimited storage** — Delta tables handle long-term retention without the constraints of experiment storage | ||||||
| - **Broad visibility** — anyone with table access can view traces regardless of which experiment produced them | ||||||
| - **OTEL compatibility** — trace IDs use URI format, improving interoperability with external observability tools | ||||||
|
|
||||||
| ## Getting Started | ||||||
|
|
||||||
| You will need MLflow 3.9.0 or later and a Unity Catalog-enabled workspace. | ||||||
|
|
||||||
| ``` python | ||||||
|
||||||
| ``` python | |
| ```bash |
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
| @@ -0,0 +1,136 @@ | ||||||
| --- | ||||||
| title: "Monitoring Your Jobs with Lakeflow System Tables" | ||||||
| description: "Learn how to use the now GA Lakeflow system tables to monitor job runs, track task performance, and analyse compute costs across your Databricks account." | ||||||
| date-modified: "07/02/2026" | ||||||
| date-format: "DD/MM/YYYY" | ||||||
| categories: [Lakeflow, Observability, Data Engineering] | ||||||
| toc: true | ||||||
| toc-title: Navigation | ||||||
| tags: | ||||||
| - databricks | ||||||
| - lakeflow | ||||||
| - jobs | ||||||
| - system-tables | ||||||
| - monitoring | ||||||
| - observability | ||||||
| - tips | ||||||
| draft: false | ||||||
| --- | ||||||
|
|
||||||
| ## Summary | ||||||
|
|
||||||
| - The Lakeflow system tables `jobs`, `job_tasks`, `job_run_timeline`, and `job_task_run_timeline` are now Generally Available as of January 2026 | ||||||
| - These tables provide account-wide visibility into all job definitions, run history, and task-level execution metrics | ||||||
| - Join with `billing.usage` to calculate cost per job run for precise spend attribution | ||||||
|
||||||
| - Join with `billing.usage` to calculate cost per job run for precise spend attribution | |
| - Join with `system.billing.usage` to calculate cost per job run for precise spend attribution |
Copilot
AI
Feb 9, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The table header rows start with ||, which will render an extra empty column (or break table parsing) in most Markdown/Quarto renderers. Use standard table syntax with a single leading | for each row.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The table header rows start with
||, which likely produces an unintended empty first column in rendered output. Use standard Markdown/Quarto table syntax with a single leading|per row.