rythm: fix ingestion slack time range #4459

javiermolinar · 2024-12-16T16:20:31Z

What this PR does:
It fixes the ingestion slack by calculating it according to the partition's last commit.

Using a fixed time on WAL creation won't work as we want for several reasons. Every partition can have a different offset and the wall is created before reading Kafka. Also, we don't have a way to know this start time in case of wal replay.

The adjustment of the ingestion slack should be done outside the WAL, since we could have different strategies. Another possibility could be storing a function to calculate it but I have opted for this approach that extends the current interface without modifying it.

modules/blockbuilder/partition_writer.go

modules/blockbuilder/tenant_store.go

joe-elliott · 2024-12-17T13:19:32Z

I have not dug into the details of this PR but to share some history about ingestion slack. When we first rolled Tempo out internally it would mark a block's start and time using the min start time and max end time of all spans in the block. We quickly found that every block had a start time of 0 and an end time 100 years in the future due to the data we were consuming. So I created the ingestion slack to prevent this.

Ingestion slack is gross to calculate and I have since just wished I had added 5 minutes to the beginning and end of every block instead of trying to watch all spans and only doing it conditionally.

You all are welcome to tackle this problem however you want, and I'd be glad to discuss it sync if you're like to, but just wanted to give some history.

mapno

I like the new behaviour, but I'm unsure of changing how Append and AppendTrace work like that. Specially only changing it for one parquet version. There is no warning to the callers of the functions.

tempodb/encoding/vparquet4/wal_block.go

modules/blockbuilder/blockbuilder.go

rythm: fix ingestion slack time range

db9fb73

javiermolinar requested review from joe-elliott, mdisibio, mapno, yvrhdn, zalegrala, electron0zero, ie-pham and stoewer as code owners December 16, 2024 16:20

mapno reviewed Dec 17, 2024

View reviewed changes

modules/blockbuilder/partition_writer.go Outdated Show resolved Hide resolved

modules/blockbuilder/tenant_store.go Outdated Show resolved Hide resolved

javiermolinar added 4 commits December 19, 2024 11:04

Merge branch 'main-rhythm' into rhythm-fix-ingestion-time-range

dd159d7

fix linting

e2b0654

fix startTime for partition writer

fe8253b

open slack ingestion by using the end cycle time

5b73700

javiermolinar requested a review from mapno December 19, 2024 14:01

javiermolinar added 3 commits December 19, 2024 16:15

Merge branch 'main-rhythm' into rhythm-fix-ingestion-time-range

703a1e6

fix test

a8bd98a

fix lint

f3d66cb

mapno reviewed Dec 20, 2024

View reviewed changes

tempodb/encoding/vparquet4/wal_block.go Show resolved Hide resolved

modules/blockbuilder/blockbuilder.go Outdated Show resolved Hide resolved

propagate changes to old parquet encondings

e5c6c05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rythm: fix ingestion slack time range #4459

rythm: fix ingestion slack time range #4459

javiermolinar commented Dec 16, 2024

joe-elliott commented Dec 17, 2024

mapno left a comment

rythm: fix ingestion slack time range #4459

Are you sure you want to change the base?

rythm: fix ingestion slack time range #4459

Conversation

javiermolinar commented Dec 16, 2024

joe-elliott commented Dec 17, 2024

mapno left a comment

Choose a reason for hiding this comment