feat: create testruns model in timeseries app #508

joseph-sentry · 2025-02-10T21:25:45Z

creates the Testrun and TestrunSummary models for TA. We start by creating the regular table and then make it a hypertable, then we create the continuous aggregates, then set the cagg policies, then finally create the Testrun and TestrunBranchSummary

codecov · 2025-02-11T16:45:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.98%. Comparing base (3aea532) to head (4bef0cc).

❗ Current head 4bef0cc differs from pull request most recent head aadbae1

Please upload reports for the commit aadbae1 to get more accurate results.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #508      +/-   ##
==========================================
- Coverage   90.46%   89.98%   -0.49%     
==========================================
  Files         463      324     -139     
  Lines       13264     9044    -4220     
  Branches     2116     1599     -517     
==========================================
- Hits        11999     8138    -3861     
+ Misses       1140      845     -295     
+ Partials      125       61      -64

Flag	Coverage Δ
shared-docker-uploader	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nora-codecov

just one q - do you want to do RiskyRunSQL? I'm not sure if we do that for timeseries things.

Swatinem

lgtm.

it would be nice to give the migrations a readable name, as well as rename the aggregate field failing_commits to make it more obvious.

otherwise, I would maybe add another index on repo_id, branch for the main testruns table.
The experiments I did last week using clickhouse made it clear that doing pre-aggregation on the high-cardinality branch (lots of feature branches, but very few runs for those each) are not making a ton of sense, and its possible to use the raw data table to query for per-test, or aggregate per-feature-branch.
The pre-aggregation / materialized view make sense for the main branch and "across all branches" though.

Swatinem · 2025-02-12T10:18:22Z

shared/django_apps/timeseries/migrations/0016_auto_20250206_1513.py

+                    COUNT(DISTINCT CASE WHEN outcome = 'failure' OR outcome = 'flaky_fail' THEN commit_sha ELSE NULL END) AS cwf,
+                    time_bucket(interval '1 days', timestamp) as timestamp_bin,


Suggested change

COUNT(DISTINCT CASE WHEN outcome = 'failure' OR outcome = 'flaky_fail' THEN commit_sha ELSE NULL END) AS cwf,

time_bucket(interval '1 days', timestamp) as timestamp_bin,

time_bucket(interval '1 days', timestamp) as timestamp_bin,

COUNT(DISTINCT CASE WHEN outcome = 'failure' OR outcome = 'flaky_fail' THEN commit_sha ELSE NULL END) AS failing_commits,

please spell out failing_commits instead of the cryptic cwf.

also IMO its a bit more readable to visually group / separate the "group by" columns and the aggregates.

Swatinem · 2025-02-12T10:19:39Z

shared/django_apps/timeseries/migrations/0019_auto_20250206_1657.py

+
+class Migration(migrations.Migration):
+    dependencies = [
+        ("timeseries", "0018_auto_20250206_1657"),


can you give these migrations a readable name?
also, it is possible to merge some of those together, or do they have to be separate migrations?

Swatinem · 2025-02-12T10:20:27Z

shared/django_apps/timeseries/migrations/0019_auto_20250206_1657.py

+                start_offset => '7 days',
+                end_offset => '1 days',


is there some documentation of what these mean?
I thought we want to aggregate things for 60 days. what does the 7 days here mean in relation to that?

there's docs here, we do want to aggregate for 60 days, but we won't be refreshing the aggregated rows past 7 days, so if we somehow process a seven day old upload again, we won't be refreshing the continuous aggregate with that data.

I think we can tune this policy as we see fit going forward, this is kinda just a placeholder, and I don't think the start_offset even needs to be that far back.

Swatinem · 2025-02-12T10:23:44Z

shared/django_apps/timeseries/models.py

+            models.Index(
+                fields=["repo_id", "test_id", "flags_hash"],
+            ),
+        ]
+        constraints = [
+            models.UniqueConstraint(
+                fields=["repo_id", "test_id", "flags_hash"],
+                name="flags_hash_test_id_unique",
+            ),


I believe a unique constraint already implicitly creates an index that can be used for queries. which means this would be duplicated.

on the other hand, I don’t see any indices being defined on the materialized views. Do we not need those?

Timescale automatically creates an index on the group by condition of continuous aggregates: https://docs.timescale.com/use-timescale/latest/continuous-aggregates/create-index/#automatically-created-indexes

i'm going to remove the unique constraint for now

Swatinem · 2025-02-12T10:25:17Z

shared/django_apps/timeseries/models.py

+    test_id = models.BinaryField(null=False)
+    flags_hash = models.BinaryField(null=True)


is the flags_hash not part of the test_id?
we also store all the flags inline as the flags array (which IMO is a better idea than having a join table)

do we need the flags_hash at all in that case then?

is the flags_hash not part of the test_id?

not anymore, i guess we can remove it since we're storing the flags in the table and we aren't using it in any of the CAggs

joseph-sentry · 2025-02-12T17:34:10Z

The experiments I did last week using clickhouse made it clear that doing pre-aggregation on the high-cardinality branch (lots of feature branches, but very few runs for those each) are not making a ton of sense, and its possible to use the raw data table to query for per-test, or aggregate per-feature-branch.
The pre-aggregation / materialized view make sense for the main branch and "across all branches" though.

right, i just verified this locally with timescale and i think you're right, it also reflects what you saw with your testing in Alloy for the dedicated tests view.

joseph-sentry · 2025-02-12T18:45:31Z

just one q - do you want to do RiskyRunSQL? I'm not sure if we do that for timeseries things.

i think it should be fine in this case because from what i understand we usually do risky migrations for operations that will lock a table for a long time, like creating an index on a large table like uploads or repos. Since in this case we're creating a new table then running operations on it while its empty I think it should be fine

Swatinem · 2025-02-13T08:39:12Z

shared/django_apps/timeseries/migrations/0015_testrun_model.py

+        migrations.AddIndex(
+            model_name="measurement",


this seems unrelated? might be good to move to a different PR/migration.

joseph-sentry requested a review from a team February 10, 2025 21:25

joseph-sentry mentioned this pull request Feb 10, 2025

create utils for accessing testrun timescale models codecov/worker#1078

Draft

joseph-sentry force-pushed the joseph/testruns branch from e31d268 to 8d1d0a2 Compare February 11, 2025 16:45

joseph-sentry force-pushed the joseph/testruns branch 2 times, most recently from 0d177e8 to 2fdefc4 Compare February 11, 2025 21:37

nora-codecov approved these changes Feb 11, 2025

View reviewed changes

Swatinem approved these changes Feb 12, 2025

View reviewed changes

joseph-sentry force-pushed the joseph/testruns branch from 2fdefc4 to 11856f5 Compare February 12, 2025 17:35

joseph-sentry force-pushed the joseph/testruns branch 2 times, most recently from 953bfa0 to 4bef0cc Compare February 12, 2025 21:39

Swatinem approved these changes Feb 13, 2025

View reviewed changes

feat: create testruns model in timeseries app

aadbae1

joseph-sentry force-pushed the joseph/testruns branch from 4bef0cc to aadbae1 Compare February 13, 2025 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: create testruns model in timeseries app #508

feat: create testruns model in timeseries app #508

joseph-sentry commented Feb 10, 2025

codecov bot commented Feb 11, 2025 •

edited

Loading

nora-codecov left a comment

Swatinem left a comment

Swatinem Feb 12, 2025

Swatinem Feb 12, 2025

Swatinem Feb 12, 2025

joseph-sentry Feb 12, 2025

Swatinem Feb 12, 2025

joseph-sentry Feb 12, 2025

Swatinem Feb 12, 2025

joseph-sentry Feb 12, 2025 •

edited

Loading

joseph-sentry commented Feb 12, 2025

joseph-sentry commented Feb 12, 2025

Swatinem Feb 13, 2025

		COUNT(DISTINCT CASE WHEN outcome = 'failure' OR outcome = 'flaky_fail' THEN commit_sha ELSE NULL END) AS cwf,
		time_bucket(interval '1 days', timestamp) as timestamp_bin,

		test_id = models.BinaryField(null=False)
		flags_hash = models.BinaryField(null=True)

feat: create testruns model in timeseries app #508

Are you sure you want to change the base?

feat: create testruns model in timeseries app #508

Conversation

joseph-sentry commented Feb 10, 2025

codecov bot commented Feb 11, 2025 • edited Loading

Codecov Report

nora-codecov left a comment

Choose a reason for hiding this comment

Swatinem left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joseph-sentry Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

joseph-sentry commented Feb 12, 2025

joseph-sentry commented Feb 12, 2025

Choose a reason for hiding this comment

codecov bot commented Feb 11, 2025 •

edited

Loading

joseph-sentry Feb 12, 2025 •

edited

Loading