Create Models

In Rill, data models are built using SQL SELECT statements applied to your source data. They allow you to join, transform, and clean data.

Avoid Pre-aggregated Metrics

Rill works best for slicing and dicing data meaning keeping data closer to raw to retain that granularity for flexible analysis. When loading data, be careful with adding pre-aggregated metrics like averages as that could lead to unintended results like a sum of an average. Instead, load the two raw metrics and calculate the derived metric in your model or dashboard.

SQL transformations

By default, data transformations in Rill Developer are powered by DuckDB and its dialect of SQL (DuckDB SQL). Please visit the DuckDB SQL documentation to learn how to write your queries.

It is possible to change the default OLAP engine for the entire project or a specific metrics view. You will need to define the connector credentials within your Rill project or via environment variables.

For additional tips on commonly used expressions (either in models or dashboard definitions), visit our common expressions page.

Adding a data model

Using the UI

In the UI, add a new data model by clicking the '+' icon next to 'Models' in the left-hand navigation pane. You can now begin typing a DuckDB SQL SELECT query for your model in the code editor – with real-time feedback.

Using code

When you add a data model using the UI, a code definition will automatically be created as a <model_name>.sql file in the models folder of your Rill project.

You can also create a model outside of the application and add it to Rill by placing a <model_name>.sql file in the models directory containing a DuckDB SQL SELECT statement. Rill will automatically detect and parse the model the next time you run rill start.

tip

How to use data models

One Big Table and dashboarding

It is powerful to be able to translate many ad hoc questions into a data framework that can answer a class of questions at scale. For example, high-level insights that are relevant to a company (how much revenue did we make last week?) are more actionable for an employee if it is relevant for their role (how did my campaign increase revenue last week?).

To experience the full potential of Rill, model your data sources into "One Big Table" – a granular resource that contains as much information as possible and can be rolled up in a meaningful way. This flexible OBT can be combined with a generalizable metrics definition to enable ad hoc slice-and-dice discovery through Rill's interactive dashboard.

Intermediate processing

Models can also be cross-referenced with each other to produce the final output for your dashboard. The advantage here is that more complex, intermediate data transformations can be utilized to achieve your final source for the dashboard. Example ideas for modeling:

Lookups for id/name joins
Unnesting and merging complex data types
Combining multiple sources with data cleansing or transformation requirements

Working with Pivots

Pivots deserve their own section, as using the Pivot statement while modeling requires special consideration. Notably, there are a few existing DuckDB limitations to consider:

DuckDB's SQL to JSON serializer doesn't support PIVOT without the IN filter
DuckDB doesn't support creating views based on PIVOT without an IN filter (and all models are materialized as views by default in Rill)

Fortunately, there are a few workarounds that we can leverage to circumvent these limitations.

Passing the `IN` filter with your `PIVOT` statement

If you know the exact values that you are trying to pivot on, you can simply pass in these values as part of your pivot query by using an IN filter with your ON clause (link to DuckDB documentation). For example, rather than:

PIVOT table_name ON column_name USING SUM(measure)

You can use the following PIVOT statement:

PIVOT table_name ON column_name IN (value_a, value_b, value_c) USING SUM(measure)

Materializing the model as a table and then utilizing the `ref` function

If you need to use the PIVOT statement specifically but don't want to specify an IN filter, then you will need to inform Rill to materialize this model as a table and leverage the ref function as well (for proper DAG resolution). Using the same example, this would instead look something like:

-- @materialize: true

PIVOT {{ ref "table_name" }} ON column_name USING SUM(measure)

A note on model materialization

The -- @materialize: true at the top of your model SQL file informs Rill to materialize the model as a table in the underlying OLAP engine (instead of the default view). More details about materializing models can be found on our model reference page.

SQL transformations​

Adding a data model​

Using the UI​

Using code​

How to use data models​

One Big Table and dashboarding​

Intermediate processing​

Working with Pivots​

Passing the IN filter with your PIVOT statement​

Materializing the model as a table and then utilizing the ref function​