Fabric Notebooks: implementing CI/CD ribboning and testing for consistent, modular code

6 minutes reading time – Jan. 29, 2026

Microsoft Fabric is an attractive choice for data teams looking to iterate quickly. But for those coming from a code-first world, the “DevOps gap” in Fabric is a serious frustration.

How do you enforce PEP8, run unit tests and monitor code consistency when your logic is locked into notebooks?

This problem has existed for some time, but the fashionable answer these days is “just use an AI agent that solves it.”

Only: between data privacy, international data residency, and the risk of hallucinations, AI often adds mostly complexity – and we don’t need it.

The good news: because Fabric notebooks now sync via Git as virtually interoperable Python files, we can build a robust, local-first CI/CD pipeline with Ruff, Pytest and the Python AST (Abstract Syntax Tree) module.

Without AI. Just craftsmanship.

The philosophy: why “boring” tooling wins

As Senior Data Engineers, we want a pipeline that is deterministic, fast and understandable. No magic, no surprises. The goal is a workflow that:

Enforce consistency
No more discussions about line lengths or formatting.
Modularity encourages
Code that is not in a function or class is hardly testable. This enforces better habits.
Readability improves
Most of your time is spent reading code. Modular, lined code is simply cheaper to maintain.
Independent remains
No API calls to external services. Only pure Python on your build agent.

By controlling this in your own Git environment (Azure DevOps, GitHub, GitLab), you maintain complete control.
Compared to AI agents, “boring” tools also have distinct advantages:

No hallucinations – the linter follows PEP8, no guesswork.
No API fees – you use CPU cycles you are already paying for.
No governance nightmare – sensitive data stays within your trusted Git environment.

Part 1: the “magic” from notebooks ribbons

Fabric notebooks are full of magic commands (%pip, %sql, %run). Useful for speed, disastrous for standard linters like Ruff – they crash immediately.

The solution: the preamble strategy

We hide magic commands temporarily, without forgetting where they were.
We do this by prefixing lines beginning with % with a unique comment marker, for example:

Kopiëren

Why those three pipes (|||)?

When Ruff adds or deletes rules, we need a watertight way to recover our magic commands. With a unique preamble, we can always recover them exactly after ribboning.

Implementation in Python:

Kopiëren

@contextmanager
def magic_command_mask(filepath: str):
    """
    Context manager die % magic commands vervangt door een comment preamble
    zodat linters ze negeren, en ze daarna weer herstelt.
    """
    def _apply_transform(reverse: bool):
        with open(filepath, 'r') as f:
            lines = f.readlines()

new_lines = []
        for line in lines:
            if reverse:
                # Herstel magics: #||| %pip -> %pip
                if line.startswith(MAGIC_PREAMBLE):
                    new_lines.append(line.replace(MAGIC_PREAMBLE, "", 1))
                else:
                    new_lines.append(line)
            else:
                # Comment magic commands uit: %pip -> #||| %pip
                if line.strip().startswith('%'):
                    new_lines.append(f"{MAGIC_PREAMBLE}{line}")
                else:
                    new_lines.append(line)

with open(filepath, 'w') as f:
            f.writelines(new_lines)

_apply_transform(reverse=False)
    try:
        yield filepath
    finally:
        _apply_transform(reverse=True)

About multi-line magics

IPython technically allows magic commands to span multiple lines with backslashes.

But honestly: if you do that, you build a maintenance problem. Keep your magics simple and your Python logic modular. Time wasted on unreadable code is more expensive than any compute.

The pipeline flow

Transform – % → #||| %
Ribbons & fixes – ruff check --fix file.py
Recover – #||| % → %
Committing – if there are changes, the pipeline commits them back

Kopiëren

Sample output:

Kopiëren

Pro-tip: don’t trust a bot to commit directly?
Have the pipeline create a separate branch (fix/linting-XYZ) and automatically open a PR. A human still clicks on merge – secure AND scalable.

That way your pipeline doesn’t whine about technical debt.
He solves it.

Part 2: unit testing without the “Run All” misery

Notebook testing is tricky. An import often runs the entire notebook.
If that includes a heavy Spark job, you may wait 20 minutes for your test to fail – for one helper function.

The solution: the AST (Abstract Syntax Tree)

Instead of executing the file, we parse it with Python’s ast-module.
We are looking for targeted:

functions
classes
imports

We detach those from the notebook and load them into the test scope, without triggering top-level code.

What does this yield?

The “modularity root“
Only code in functions and classes is tested. Loose clutter at the top does not count.
No side-effects
No if __name__ == "__main__"-constructions or hacks to prevent execution.

Kopiëren

Real Spark, local speed

No heavy mocks.
We start a single-node Spark session locally on the build agent (Azure DevOps or GitHub Actions).

Spark + Delta installs surprisingly quickly on modern agents.
You test real transformations and schema logic, without using Fabric capability.

Kopiëren

Tests

With access to Spark and your notebook features, you can write tests – even in notebook form, as you are used to in Fabric.

Kopiëren

What you test, and how to do it maintainably, deserves a separate blog.

conclusion

The fact that Fabric stores notebooks as transparent Python files is a gift to Data Engineers.
It makes it possible to apply proven software engineering principles without losing notebook speed.

With AST for selective testing and a simple preamble for magic commands, we bridge the gap between rapid prototyping and production quality.
The result: a consistent, maintainable codebase – without added complexity. And without hallucinations.

"*" indicates required fields

I am a

Company

Developer

Name*

First Last

Phone*

Email address*

Message*

takeaways:

Fabric notebooks are fine CI/CD-proof

if you treat them as Python code and enforce linting + tests locally via Git.

AST > "Run All"

By testing only functions and classes, you avoid slow, fragile notebook tests.

Boring tooling wins

Ruff, Pytest and Spark locally deliver consistent code without AI complexity or governance hassles.

Fabric Notebooks: implementing CI/CD ribboning and testing for consistent, modular code

The philosophy: why “boring” tooling wins

Part 1: the “magic” from notebooks ribbons

The solution: the preamble strategy

About multi-line magics

The pipeline flow

Part 2: unit testing without the “Run All” misery

The solution: the AST (Abstract Syntax Tree)

What does this yield?

Real Spark, local speed

Tests

conclusion

takeaways:

Fabric notebooks are fine CI/CD-proof

AST > "Run All"

Boring tooling wins

Want to spar about this approach or other data engineering topics?

Related articles

Last year I made €100k, and I’m proud of it

Time For Change With The Lukashian Calendar

KPN CO: Mobile Apps

Fabric Notebooks: implementing CI/CD ribboning and testing for consistent, modular code

Inspired? Share this article

The philosophy: why “boring” tooling wins

Part 1: the “magic” from notebooks ribbons

The solution: the preamble strategy

About multi-line magics

The pipeline flow

Part 2: unit testing without the “Run All” misery

The solution: the AST (Abstract Syntax Tree)

What does this yield?

Real Spark, local speed

Tests

conclusion

takeaways:

Fabric notebooks are fine CI/CD-proof

AST > "Run All"

Boring tooling wins

Want to spar about this approach or other data engineering topics?

Related articles

Last year I made €100k, and I’m proud of it

Time For Change With The Lukashian Calendar

KPN CO: Mobile Apps