Marimo Flow 🌊

Interactive ML notebooks with reactive updates, AI assistance, and MLflow tracking

Like marimo algae drifting in crystal waters, your code flows and evolves – each cell a living sphere of computation, gently touching others, creating ripples of reactive change. In this digital ocean, data streams like currents, models grow like organic formations, and insights emerge naturally from the depths. Let your ML experiments flow freely, tracked and nurtured, as nature intended.

marimo-flow.mp4

Why Marimo Flow is Powerful 🚀

Marimo Flow combines reactive notebook development with AI-powered assistance and robust ML experiment tracking:

🤖 AI-First Development with MCP: Model Context Protocol (MCP) integration brings live documentation, code examples, and AI assistance directly into your notebooks - access up-to-date library docs for Marimo, Polars, Plotly, and more without leaving your workflow
🔄 Reactive Execution: Marimo's dataflow graph ensures your notebooks are always consistent - change a parameter and watch your entire pipeline update automatically
📊 Seamless ML Pipeline: MLflow integration tracks every experiment, model, and metric without breaking your flow
🎯 Interactive Development: Real-time parameter tuning with instant feedback and beautiful visualizations

This combination eliminates the reproducibility issues of traditional notebooks while providing AI-enhanced, enterprise-grade experiment tracking.

Features ✨

🤖 AI-Powered Development (MCP)

Model Context Protocol Integration: Live documentation and AI assistance in your notebooks
Context7 Server: Access up-to-date docs for any Python library without leaving marimo
Marimo MCP Server: Specialized assistance for marimo patterns and best practices
Local LLM Support: Ollama integration for privacy-focused AI code completion

📊 ML Development Workflow

📓 Reactive Notebooks: Git-friendly .py notebooks with automatic dependency tracking
🔬 MLflow Tracking: Complete ML lifecycle management with model registry
🎯 Interactive Development: Real-time parameter tuning with instant visual feedback
💾 SQLite Backend: Lightweight, file-based storage for experiments

🚀 Production Ready

🐳 Docker Deployment: One-command setup with docker-compose
📦 Curated Snippets & Tutorials: 4 reusable snippet modules plus 15+ tutorial notebooks covering Polars, Plotly, Marimo UI patterns, RAG, and OpenVINO
📚 Comprehensive Docs: Built-in reference guides with 100+ code examples
🌐 GitHub Pages: Auto-deploy interactive notebooks with WASM

Quick Start 🏃‍♂️

With Docker (Recommended)

# Clone repository
git clone https://github.com/bjoernbethge/marimo-flow.git
cd marimo-flow

# Build and start services
docker compose -f docker/docker-compose.yaml up --build -d

# Access services
# Marimo: http://localhost:2718
# MLflow: http://localhost:5000

# View logs
docker compose -f docker/docker-compose.yaml logs -f

# Stop services
docker compose -f docker/docker-compose.yaml down

Local Development

# Install dependencies
uv sync

# Start MLflow server (in background or separate terminal)
uv run mlflow server \
  --host 0.0.0.0 \
  --port 5000 \
  --backend-store-uri sqlite:///data/experiments/db/mlflow.db \
  --default-artifact-root ./data/experiments/artifacts \
  --serve-artifacts

# Start Marimo (in another terminal)
uv run marimo edit examples/

Example Notebooks 📚

All notebooks live in examples/ and can be opened with uv run marimo edit examples/<file>.py.

01_interactive_data_profiler.py – DuckDB-powered data explorer with filters, previews, and interactive scatter plots for any local database.
02_mlflow_experiment_console.py – Connect to an MLflow tracking directory, inspect experiments, and visualize metric trends inline with Altair.
03_pina_walrus_solver.py – Toggle between baseline PINNs and the Walrus adapter to solve a Poisson equation with live training controls.
04_hyperparameter_tuning.py – Optuna-based hyperparameter search for PINA/PyTorch models with MLflow tracking and interactive study settings.
05_model_registry.py – Train, register, and promote MLflow models end-to-end, including stage transitions and inference checks.
06_production_pipeline.py – Production-style pipeline featuring validation gates, training, registry integration, deployment steps, and monitoring hooks.
09_pina_live_monitoring.py – Live training monitoring with real-time loss plotting, error analysis, and comprehensive visualization tools.

Additional learning material lives in examples/tutorials/ (15+ focused notebooks covering marimo UI patterns, Polars, Plotly, DuckDB, OpenVINO, RAG, and PYG) plus examples/tutorials/legacy/ for the retired 00–03 pipeline.

Project Structure 📁

marimo-flow/
├── examples/                    # Production-ready marimo notebooks
│   ├── 01_interactive_data_profiler.py
│   ├── 02_mlflow_experiment_console.py
│   ├── 03_pina_walrus_solver.py
│   ├── 04_hyperparameter_tuning.py
│   ├── 05_model_registry.py
│   ├── 06_production_pipeline.py
│   ├── 09_pina_live_monitoring.py
│   └── tutorials/                # 15+ focused learning notebooks (+ legacy/)
├── snippets/                   # Reusable Python modules for notebooks
│   ├── __init__.py
│   ├── altair_visualization.py
│   ├── data_explorer_pattern.py
│   └── pina_basics.py
├── tools/                       # Utility tools
│   ├── ollama_manager.py           # Local LLM orchestration
│   └── openvino_manager.py         # Model serving utilities
├── docs/                        # Reference documentation
│   ├── marimo-quickstart.md        # Marimo guide
│   ├── polars-quickstart.md        # Polars guide
│   ├── plotly-quickstart.md        # Plotly guide
│   ├── pina-quickstart.md          # PINA guide
│   └── integration-patterns.md     # Integration examples
├── data/
│   └── mlflow/                  # MLflow storage
│       ├── artifacts/           # Model artifacts
│       ├── db/                  # SQLite database
│       └── prompts/             # Prompt templates
├── docker/                      # Docker configuration
├── pyproject.toml              # Dependencies
└── README.md                   # This file

📝 About Snippets

The snippets/ directory contains reusable code patterns built for direct import into Marimo notebooks:

altair_visualization.py: opinionated chart builders and theming helpers
data_explorer_pattern.py: column filtering + scatter plotting utilities
pina_basics.py: Walrus/PINA helpers (problem setup, solver, visualization)

All examples already import these where needed; use them to jump-start your own notebooks or extend the shipped apps. Additional pattern walk-throughs live in examples/tutorials/.

🛠️ About Tools

The tools/ directory contains standalone utility scripts for managing external services:

ollama_manager.py: Manage local LLM deployments with Ollama
openvino_manager.py: Model serving and inference with OpenVINO

📚 About References

The docs/ directory contains comprehensive LLM-friendly documentation for key technologies:

Quick-start guides for Marimo, Polars, Plotly, and PINA
Integration patterns and best practices
Code examples and common workflows

MCP (Model Context Protocol) Integration 🔌

Marimo Flow is AI-first with built-in Model Context Protocol (MCP) support for intelligent, context-aware development assistance.

Why MCP Matters

Traditional notebooks require constant context-switching to documentation sites. With MCP:

📚 Live Documentation: Access up-to-date library docs directly in marimo
🤖 AI Code Completion: Context-aware suggestions from local LLMs (Ollama)
💡 Smart Assistance: Ask questions about libraries and get instant, accurate answers
🔄 Always Current: Documentation updates automatically, no more outdated tutorials

Pre-Configured MCP Servers

Context7 - Universal Library Documentation

Access real-time documentation for any Python library:

# Ask: "How do I use polars window functions?"
# Get: Current polars docs, code examples, best practices

# Ask: "Show me plotly 3D scatter plot examples"
# Get: Latest plotly API with working code samples

Supported Libraries:

Polars, Pandas, NumPy - Data manipulation
Plotly, Altair, Matplotlib - Visualization
Scikit-learn, PyTorch - Machine Learning
And 1000+ more Python packages

Marimo - Specialized Notebook Assistance

Get expert help with marimo-specific patterns:

# Ask: "How do I create a reactive form in marimo?"
# Get: marimo form patterns, state management examples

# Ask: "Show me marimo UI element examples"
# Get: Complete UI component reference with code

Real-World Examples

Example 1: Learning New Libraries

# You're exploring polars window functions
# Type: "polars rolling mean example"
# MCP returns: Latest polars docs + working code
df.with_columns(
    pl.col("sales").rolling_mean(window_size=7).alias("7d_avg")
)

Example 2: Debugging

# Stuck on a plotly error?
# Ask: "Why is my plotly 3D scatter not showing?"
# Get: Common issues, solutions, and corrected code

Example 3: Best Practices

# Want to optimize code?
# Ask: "Best way to aggregate in polars?"
# Get: Performance tips, lazy evaluation patterns

AI Features Powered by MCP

Code Completion: Context-aware suggestions as you type (Ollama local LLM)
Inline Documentation: Hover over functions for instant docs
Smart Refactoring: AI suggests improvements based on current libraries
Interactive Q&A: Chat with AI about your code using latest docs

Configuration

MCP servers are pre-configured in .marimo.toml:

[mcp]
presets = ["context7", "marimo"]

[ai.ollama]
model = "gpt-oss:20b-cloud"
base_url = "http://localhost:11434/v1"

If you're running inside Docker, the same mcp block lives in docker/.marimo.toml, so both local and containerized sessions pick up identical presets.

Adding Custom MCP Servers

You can extend functionality by adding custom MCP servers in .marimo.toml:

[mcp.mcpServers.your-custom-server]
command = "npx"
args = ["-y", "@your-org/your-mcp-server"]

MLflow Trace Server (Optional)

Expose MLflow trace operations to MCP-aware IDEs/assistants (e.g., Claude Desktop, Cursor) by running:

mlflow mcp run

Run the command from an environment where MLFLOW_TRACKING_URI (or MLFLOW_BACKEND_STORE_URI/MLFLOW_DEFAULT_ARTIFACT_ROOT) points at your experiments. The server stays up until interrupted and can be proxied alongside Marimo/MLflow so every tool shares the same MCP context.

Learn More:

Marimo MCP Guide - Official MCP documentation
Model Context Protocol - MCP specification and resources

Configuration ⚙️

Environment Variables

Docker setup (configured in docker/docker-compose.yaml):

MLFLOW_BACKEND_STORE_URI: sqlite:////app/data/experiments/db/mlflow.db
MLFLOW_DEFAULT_ARTIFACT_ROOT: /app/data/experiments/artifacts
MLFLOW_HOST: 0.0.0.0 (allows external access)
MLFLOW_PORT: 5000
OLLAMA_BASE_URL: http://host.docker.internal:11434 (requires Ollama on host)

Local development:

MLFLOW_TRACKING_URI: http://localhost:5000 (default)

Docker Services

The Docker container runs both services via docker/start.sh:

Marimo: Port 2718 - Interactive notebook environment
MLflow: Port 5000 - Experiment tracking UI

GPU Support: NVIDIA GPU support is enabled by default. Remove the deploy.resources section in docker-compose.yaml if running without GPU.

Pre-installed ML & Data Science Stack 📦

Machine Learning & Scientific Computing

scikit-learn ^1.5.2 - Machine learning library
NumPy ^2.1.3 - Numerical computing
pandas ^2.2.3 - Data manipulation and analysis
PyArrow ^18.0.0 - Columnar data processing
SciPy ^1.14.1 - Scientific computing
matplotlib ^3.9.2 - Plotting library

High-Performance Data Processing

Polars ^1.12.0 - Lightning-fast DataFrame library
DuckDB ^1.1.3 - In-process analytical database
Altair ^5.4.1 - Declarative statistical visualization

AI & LLM Integration

OpenAI ^1.54.4 - GPT API integration
FastAPI ^0.115.4 - Modern web framework
Pydantic ^2.10.2 - Data validation

Database & Storage

SQLAlchemy ^2.0.36 - SQL toolkit and ORM
Alembic ^1.14.0 - Database migrations
SQLGlot ^25.30.2 - SQL parser and transpiler

Web & API

Starlette ^0.41.2 - ASGI framework
Uvicorn ^0.32.0 - ASGI server
httpx ^0.27.2 - HTTP client

Development Tools

Black ^24.10.0 - Code formatter
Ruff ^0.7.4 - Fast Python linter
pytest ^8.3.3 - Testing framework
MyPy ^1.13.0 - Static type checker

API Endpoints 🔌

MLflow REST API

Experiments: GET /api/2.0/mlflow/experiments/list
Runs: GET /api/2.0/mlflow/runs/search
Models: GET /api/2.0/mlflow/registered-models/list

Marimo Server

Notebooks: GET / - File browser and editor
Apps: GET /run/<notebook> - Run notebook as web app

Contributing 🤝

We welcome contributions! Please see our Contributing Guidelines for details on:

Development setup and workflow
Code standards and style guide
Testing requirements
Pull request process

Quick Start for Contributors:

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes following the coding standards
Test your changes: uv run pytest
Submit a pull request

See CONTRIBUTING.md for comprehensive guidelines.

Changelog 📋

See CHANGELOG.md for a detailed version history and release notes.

Current Version: 0.1.3

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Built with ❤️ using Marimo and MLflow

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github		.github
.vscode		.vscode
asset		asset
data/mlflow		data/mlflow
docker		docker
docs		docs
examples		examples
scripts		scripts
snippets		snippets
src/marimo_flow		src/marimo_flow
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
.marimo.toml		.marimo.toml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

bjoernbethge/marimo-flow

Folders and files

Latest commit

History

Repository files navigation

Marimo Flow 🌊

Why Marimo Flow is Powerful 🚀

Features ✨

🤖 AI-Powered Development (MCP)

📊 ML Development Workflow

🚀 Production Ready

Quick Start 🏃‍♂️

With Docker (Recommended)

Local Development

Example Notebooks 📚

Project Structure 📁

📝 About Snippets

🛠️ About Tools

📚 About References

MCP (Model Context Protocol) Integration 🔌

Why MCP Matters

Pre-Configured MCP Servers

Context7 - Universal Library Documentation

Marimo - Specialized Notebook Assistance

Real-World Examples

AI Features Powered by MCP

Configuration

Adding Custom MCP Servers

MLflow Trace Server (Optional)

Configuration ⚙️

Environment Variables

Docker Services

Pre-installed ML & Data Science Stack 📦

Machine Learning & Scientific Computing

High-Performance Data Processing

AI & LLM Integration

Database & Storage

Web & API

Development Tools

API Endpoints 🔌

MLflow REST API

Marimo Server

Contributing 🤝

Changelog 📋

License 📄

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages