Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/enable aggregation #364

Merged
merged 9 commits into from
Nov 16, 2020
Merged

Feature/enable aggregation #364

merged 9 commits into from
Nov 16, 2020

Conversation

vyasr
Copy link
Contributor

@vyasr vyasr commented Nov 16, 2020

Add infrastructure for aggregation into flow. All the necessary classes are in place and the code has been modified where appropriate to handle aggregates instead of jobs. However, no actual aggregation occurs yet (everything is an aggregate of size 1). We plan to refactor and optimize the existing code paths before implementing that feature.

kidrahahjo and others added 9 commits August 13, 2020 11:30
Changes the submission id generation to prepare for aggregation. The `__str__` method of `_JobOperation` is also changed to better handle job aggregation. The unique id generated using a hash of all the `_JobOperation` object's data is still the same, so the uniqueness should not be in question; the only change is to the readable part of the id.

The change to the `str` implementation mimics NumPy, and the submission id favors brevity.
Add a new decorator by which operations can be tagged for various types of aggregation, along with classes to store actual aggregates once they have been generated.
…ing (#351)

* Use sort_ascending instead of reverse_order.

* Fix and simplify hash methods.

* Clarify docstrings.

* Initialize list/dict with literals.

* Use one-line function instead of defining a separate method.

* Don't wrap line.

* class -> object.

* use itertools reference while using groupby

* Use reference for zip_longest too

* Document _get_unique_function_id

Co-authored-by: Hardik Ojha <[email protected]>
Change the internals of flow so that everything operates on default aggregates (aggregates of size 1) rather than individual jobs.
@vyasr vyasr requested review from a team as code owners November 16, 2020 15:46
@vyasr vyasr requested review from atravitz and vishav1771 and removed request for a team, atravitz and vishav1771 November 16, 2020 15:46
@vyasr
Copy link
Contributor Author

vyasr commented Nov 16, 2020

No review necessary here. I'll merge into next as soon as the tests on enable-aggregation pass (from merging in the #335).

@vyasr vyasr merged commit 699873f into next Nov 16, 2020
@vyasr vyasr deleted the feature/enable-aggregation branch November 16, 2020 16:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants