Use Cats backpressure to throttle writes (and conversions to HTTP requests) #1000

thorkildcognite · 2024-12-11T16:10:06Z

No description provided.

…uests)

codecov · 2024-12-11T16:23:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.33%. Comparing base (28a3425) to head (54431c0).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #1000   +/-   ##
=======================================
  Coverage   83.33%   83.33%           
=======================================
  Files          47       47           
  Lines        3156     3156           
  Branches      452      455    +3     
=======================================
  Hits         2630     2630           
  Misses        526      526

Files with missing lines	Coverage Δ
...main/scala/cognite/spark/v1/RawTableRelation.scala	`94.87% <100.00%> (ø)`

dmivankov · 2024-12-11T17:57:44Z

src/main/scala/cognite/spark/v1/RawTableRelation.scala

-                .unsafeRunSync()
+          Backpressure[IO](Backpressure.Strategy.Lossless, maxOutstandingRawInsertRequests)
+            .flatMap { backpressure =>
+              rows.grouped(batchSize).toVector.parTraverse_ { batch: Seq[Row] =>


compared to _.grouped.toSeq.grouped.foreach it will now create a vector of all batches, they will reference all Row items from iterator which potentially could be not a lot of mem, less than request bodies, more than constant mem

but compared to .grouped.toVector it is about the same apart from having semaphore

doesn't look like there's a out-of-the box Seq.parTraverse_, or even better the one that would not take more items when semaphore is full, so for now we can try the .toVector.parTraverse_

Use Cats backpressure to throttle writes (and conversions to HTTP req…

e570aa9

…uests)

thorkildcognite requested a review from a team as a code owner December 11, 2024 16:10

thorkildcognite requested a review from paulcognite December 11, 2024 16:10

github-actions bot requested a review from silvavelosa December 11, 2024 16:10

no unsafeRunSync inside: use flatMap+traverse

03c025d

dmivankov temporarily deployed to CI December 11, 2024 17:51 — with GitHub Actions Inactive

parTraverse actually

54431c0

dmivankov had a problem deploying to CI December 11, 2024 17:52 — with GitHub Actions Failure

dmivankov reviewed Dec 11, 2024

View reviewed changes

dmivankov approved these changes Dec 11, 2024

View reviewed changes

dmivankov temporarily deployed to CI December 11, 2024 18:04 — with GitHub Actions Inactive

Merge branch 'master' into pressure-it-is-all-about-the-right-pressure

d56b64e

dmivankov had a problem deploying to CI December 17, 2024 13:38 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Cats backpressure to throttle writes (and conversions to HTTP requests) #1000

Use Cats backpressure to throttle writes (and conversions to HTTP requests) #1000

thorkildcognite commented Dec 11, 2024

codecov bot commented Dec 11, 2024 •

edited

Loading

dmivankov Dec 11, 2024

dmivankov Dec 11, 2024

Use Cats backpressure to throttle writes (and conversions to HTTP requests) #1000

Are you sure you want to change the base?

Use Cats backpressure to throttle writes (and conversions to HTTP requests) #1000

Conversation

thorkildcognite commented Dec 11, 2024

codecov bot commented Dec 11, 2024 • edited Loading

Codecov Report

dmivankov Dec 11, 2024

Choose a reason for hiding this comment

dmivankov Dec 11, 2024

Choose a reason for hiding this comment

codecov bot commented Dec 11, 2024 •

edited

Loading