AIP-84 Convert async route to sync routes #43797

pierrejeambrun · 2024-11-07T17:40:59Z

As discussed in #43718 (comment), routes with blocking I/O code should be sync to not block main event loop. (db access, disk read, network call, etc...)

More information in FastAPI documentation https://fastapi.tiangolo.com/async/#path-operation-functions.

This PR converts all of the endpoints, all that can me converted to async def when we implement full async support.

(I recall Maybe one or two endpoints that are purely in memory and could stay async but I didn't bother to make an exception for them)

bbovenzi · 2024-11-07T18:18:50Z

I see three endpoints still using async. Do we want to change those too?

pierrejeambrun · 2024-11-07T20:18:44Z

Good catch,, I forgot the private API.

Updated thanks Brent.

omkar-foss · 2024-11-08T06:25:26Z

Hey @pierrejeambrun, I've one concern on this - if we use sync path funcs, the FastAPI requests will run in a threadpool (one request per thread), consuming more memory per request and will limit throughput on our new APIs, as there's a default limit of 40 threads on the threadpool, please see: encode/starlette#1724 (context: AnyIO is now used for async IO by Starlette, which in turn is used by FastAPI to handle http requests).

So, just a suggestion - instead of changing all the path funcs from async to sync, it'll be great if wrap the blocking (sync) function calls inside the path funcs to make them async, using asyncio.to_thread or asyncify or similar.

dolfinus · 2024-11-08T07:55:09Z

Hm, I don't get why this code:

@route.get(...)
def handler(...):
  something = session.get(...)
  other = session.select(...)

await asyncio.to_thread(handler)

does limit thoughput, but this doesn't:

@route.get(...)
async def handler(...):
  something = await asyncio.to_thread(session.get, ...)
  other = await asyncio.to_thread(session.select, ...)

Could you please elaborate?

pierrejeambrun · 2024-11-08T10:34:46Z

I agree with @dolfinus, running in a separate thread manually or leveraging FastAPI to do so is more or less the same. (just less work and more code maintainability to let FastAPI handle that).

Long term we will rewrite that with full async support, in the meantime FastAPI is just sync for us.

omkar-foss · 2024-11-08T10:36:43Z

Hey @dolfinus, yes, the throughput in this case would be very similar for both snippets, because when calling asyncio.to_thread() (executor unspecified), the default thread pool executor will be used and be subject to same 40 thread limit. But unlike sync functions where entire request is processed in a thread, in async we'll have control over what should be processed in a separate thread.

Example: As our new APIs have a mix of CPU-bound (e.g. common params resolution, data checks, Pydantic validations, etc.) and IO-bound (e.g. DB queries, network calls) activities, I guess we could tune it something like this:

@route.get(...)
async def handler(...):
  # CPU bound
  if some_check:
      raise HTTPException(status.HTTP_404_NOT_FOUND, "Not Found")
  
  # IO bound, sent to it's own thread
  something = await asyncio.to_thread(session.get, ...)
  
  # CPU bound again
  return SomePydanticModel(something)

We could alternatively use run_in_threadpool instead of asyncio.to_thread which is provided by FastAPI, example here. Either way we go, it'll need thorough testing to understand what's working for us in terms of performance! :)

omkar-foss · 2024-11-08T10:39:56Z

Long term we will rewrite that with full async support, in the meantime FastAPI is just sync for us.

That would be lovely, thank you! :)

pierrejeambrun · 2024-11-08T11:38:38Z

I think that would be a lot of work to maintain + code becomes hard to read + 1 mistake (someone forget to manually put into the threadpool a blocking IO call) and then the main event loop is blocked...

I think we can start like that, and if it's not enough we can go deeper into the fine tuning of what is executed in the main even loop and what is run in a separate thread. I believe CPU bound operations run in a separate thread won't bottleneck. (And if they do, most likely the main even loop would struggle too, so we would have another problem here)

omkar-foss · 2024-11-08T12:59:57Z

Yes sure, sounds good! Thanks @pierrejeambrun 👍🏽

* AIP-84 convert async route to sync routes * Update following code review * Fix CI

AIP-84 convert async route to sync routes

7fe62ef

pierrejeambrun self-assigned this Nov 7, 2024

pierrejeambrun requested a review from ephraimbuddy as a code owner November 7, 2024 17:41

pierrejeambrun requested review from bbovenzi, kaxil and ashb November 7, 2024 17:41

pierrejeambrun mentioned this pull request Nov 7, 2024

Migrate public endpoint Get Task to FastAPI #43718

Merged

kaxil approved these changes Nov 7, 2024

View reviewed changes

Update following code review

7688c14

bbovenzi approved these changes Nov 7, 2024

View reviewed changes

jscheffl approved these changes Nov 7, 2024

View reviewed changes

rawwar mentioned this pull request Nov 8, 2024

AIP-84 Add ability to update dag run note in PATCH dag_run endpoint #43508

Merged

shahar1 approved these changes Nov 8, 2024

View reviewed changes

Fix CI

3cf1a50

pierrejeambrun merged commit 36e716a into apache:main Nov 8, 2024
52 checks passed

pierrejeambrun deleted the aip-84-transform-async-to-sync-route branch November 8, 2024 11:39

ellisms pushed a commit to ellisms/airflow that referenced this pull request Nov 13, 2024

AIP-84 Convert async route to sync routes (apache#43797)

1ff50dd

* AIP-84 convert async route to sync routes * Update following code review * Fix CI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AIP-84 Convert async route to sync routes #43797

AIP-84 Convert async route to sync routes #43797

pierrejeambrun commented Nov 7, 2024 •

edited

Loading

bbovenzi commented Nov 7, 2024 •

edited

Loading

pierrejeambrun commented Nov 7, 2024

omkar-foss commented Nov 8, 2024

dolfinus commented Nov 8, 2024 •

edited

Loading

pierrejeambrun commented Nov 8, 2024

omkar-foss commented Nov 8, 2024

omkar-foss commented Nov 8, 2024

pierrejeambrun commented Nov 8, 2024 •

edited

Loading

omkar-foss commented Nov 8, 2024

AIP-84 Convert async route to sync routes #43797

AIP-84 Convert async route to sync routes #43797

Conversation

pierrejeambrun commented Nov 7, 2024 • edited Loading

bbovenzi commented Nov 7, 2024 • edited Loading

pierrejeambrun commented Nov 7, 2024

omkar-foss commented Nov 8, 2024

dolfinus commented Nov 8, 2024 • edited Loading

pierrejeambrun commented Nov 8, 2024

omkar-foss commented Nov 8, 2024

omkar-foss commented Nov 8, 2024

pierrejeambrun commented Nov 8, 2024 • edited Loading

omkar-foss commented Nov 8, 2024

pierrejeambrun commented Nov 7, 2024 •

edited

Loading

bbovenzi commented Nov 7, 2024 •

edited

Loading

dolfinus commented Nov 8, 2024 •

edited

Loading

pierrejeambrun commented Nov 8, 2024 •

edited

Loading