Incremental Models
Before enabling incremental on the model, let's take a look at the following model YAML file, now.yaml.
Sample YAML
# This model outputs the current time every time it is refreshed.
type: model
refresh:
cron: 0 0 * * *
sql: SELECT now() AS inserted_on
To understand what this is doing, let's go ahead and select the Refresh button. This button performs the same command as the below in the CLI.
rill project refresh --model now --local
Since we're using Rill Developer, we will need to add the --local
flag to the refresh commands or else this will refresh the project on Rill Cloud!
After the model refreshes, you should see the inserted_on value change.
update this next week
Enable Increments on our Model
As mentioned previously, the incremental: true
tells Rill that this model is incremental. You will see that the UI changes slightly when this is enabled. Not only will you be able to full refresh, but also incrementally refresh.
type: model
refresh:
cron: 0 0 * * *
sql: SELECT now() AS inserted_on
incremental: true
After adding the following, let's run the same command. What's different?
update this
Instead of overwriting the same row, we are now appending the new values of now() into the table. With the refresh enabled to run at midnight on every night, you should see the amount of rows increase each day at midnight UTC. Next, let's take a moment to review states:
.
States in Models
Lastly, we can add a state:
key that allows us to manually define some sort of state to use as our incrementing key.
#filename: incremental_state.yaml
type: model
refresh:
cron: 0 0 * * *
sql: SELECT {{ if incremental }} {{ .state.max_val }} + 1 {{ else }} 0 {{ end}} AS val, now() AS inserted_on
state:
sql: SELECT MAX(val) as max_val FROM incremental_state
incremental: true
In more realistic cases, we could select the MAX(time_stamp) which will grab the latest time_stamp that your current model includes. Then, based on this it would incrementally refresh your model to only insert the new data. This would continue to occur. However, keep in mind that any 'old' data that gets added outside of Rill would not be detected.
Along with the inserted_on column, we are also creating a val column that defaults to 0. Then on each run, if incremental, increases this value. The state retrieves the max value of 'val' as max_val.
SELECT
{{ if incremental }}
{{ .state.max_val }} + 1 {{ else }} 0
{{ end}}
Was this content helpful?