Solutions
Solution
Industry Spotlight
.avif)
Watch our latest video case study!
Check out how Colibri's partnership with Nomo Fintech has transformed their approach to data
Learn more
Success stories
Insights
.png)
Aviary is a lightweight, self-service data portal built entirely on the Databricks platform. Powered by Databricks Lakebase (managed PostgreSQL OLTP) for sub-second catalog queries, Databricks Apps for a polished front-end, and Databricks Genie for natural language data access, Aviary lets data consumers browse, filter, query, and export governed datasets with built-in access controls and transparent chargeback pricing, without writing a single line of SQL.
Key outcomes:
In large-scale organisations, data is the connective tissue between business units. Yet, for those tasked with turning that data into strategy, the reality is often a collection of expensive, fragmented silos.
Aviary was built on a simple premise: Data consumers don’t want a warehouse; they want answers. By focusing on the specific friction points that stall institutional intelligence, Aviary transforms data from a static liability into a self-service product.
The pain points it solves:
Aviary's architecture is deliberately minimal:

Every dataset in Aviary is registered in a single datasets_metadata table stored in Lakebase. Each row defines:
This means adding a new dataset to the marketplace is a single INSERT statement into a delta table. No code changes, no redeployment.

Consumers land on a home page organised by sector (Energy, Finance, Healthcare, Transportation) and vendor. Each sector card shows a live dataset count. Clicking through reveals dataset cards with:
Licence-based access and approval workflows. Not all datasets are equal. Some carry third-party licensing terms, contain commercially sensitive information, or are governed by regulatory constraints.
Aviary classifies datasets into access tiers:

This approval workflow means organisations can onboard sensitive or licensed datasets into the marketplace without exposing them to unauthorised consumers. Once approved, access persists until explicitly revoked, providing a full audit trail of who requested access, when it was granted, and by whom.

When a consumer clicks "Export," Aviary calculates the estimated cost based on the matched row count and the dataset's chargeback rate. A confirmation dialog shows:
This makes data consumption costs visible at the point of decision, not buried in a monthly invoice.

Restricted datasets show only metadata (name, description, sector). No data preview, no export. Consumers can submit an access request directly from the card, triggering notification to dataset owners.

Why Lakebase instead of querying Delta tables directly?
Why embed Databricks Genie into a data marketplace?
Aviary's filter-based browsing works well when consumers know what they're looking for. But the most common question in any data team isn't "show me column X filtered by Y." It's: "Do we have data that can answer this question?"
Genie bridges that gap by letting consumers query datasets in plain English, without needing to understand table schemas, filter configurations, or SQL syntax.

Genie doesn't replace the structured browsing experience. It complements it. Power users who know exactly which dataset they need can go straight to it via sector/vendor navigation. Everyone else can start with a question and let Genie guide them to the right data.

Aviary demonstrates that a production-grade data portal doesn't require a massive engineering effort. By leveraging Databricks for the data and metadata layer, Databricks Apps for the front-end, and integrated identity management (OAuth) for seamless, password-less authentication, we created a governed, chargeback-aware ecosystem.

The true value of this solution lies in how it changes the daily lives of the people using it:
Ultimately, Aviary solves the business case for data democratisation. It ensures that data is no longer a fragmented technical liability locked in silos, but a high-velocity asset that is easy to find, safe to use, and transparently priced.