Help using aggregate_raw_results #56

martyzz1 · 2024-02-13T17:27:24Z

I've been going around in circles a bit trying to understand if this library can be used to speed up decoding an aggregation query.

Or whether after recent pymongo updates its needed at all.

documents = []

cursor = collection.aggregate_raw_batches(
                              pipeline=aggregation_query,
)
while True:
    try:
        documents.extend([x for x in decode_all(cursor.next())])
    except StopIteration:
        break

How would I use bsonjs.dumps instead?

The text was updated successfully, but these errors were encountered:

ShaneHarvey · 2024-02-13T18:57:51Z

This library is only useful for converting raw BSON data (eg RawBSONDocument) to MongoDB Extended JSON. If you need the documents to be decoded into Python dict then this library will not help.

Also aggregate_raw_batches is only useful when the app needs a stream of raw BSON data. If you're going to decode_all then it will be more efficient to use a regular aggregate:

documents = list(collection.aggregate(pipeline))

ShaneHarvey · 2024-02-13T19:07:02Z

For help speeding up your application I suggest posting here: https://www.mongodb.com/community/forums/tag/python

It would help to include more info about the size of the result set (how big is documents?), how long is the query vs the query decoding, what happens to documents, would it be faster to process the documents individually?, etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help using aggregate_raw_results #56

Help using aggregate_raw_results #56

martyzz1 commented Feb 13, 2024 •

edited

Loading

ShaneHarvey commented Feb 13, 2024

ShaneHarvey commented Feb 13, 2024

Help using aggregate_raw_results #56

Help using aggregate_raw_results #56

Comments

martyzz1 commented Feb 13, 2024 • edited Loading

ShaneHarvey commented Feb 13, 2024

ShaneHarvey commented Feb 13, 2024

martyzz1 commented Feb 13, 2024 •

edited

Loading