Boostings implementations eval_set fix #1358

dmitryglhf · 2025-01-15T16:01:12Z

This is a 🔨 code refactoring.

Summary

Added new conditions for training boostings with eval_set.
Updated multi output for boostings

Resolved issue with test_preprocessing_through_api/test_categorical_target_processed_correctly. In cases where we tried to use LightGBM, XGBoost or CatBoost an error appeared when using eval_set due to the small dataset.

github-actions · 2025-01-15T16:02:11Z

All PEP8 errors has been fixed, thanks ❤️

Comment last updated at Wed, 22 Jan 2025 21:10:21

dmitryglhf · 2025-01-15T16:10:22Z

/fix-pep8

codecov · 2025-01-15T16:20:24Z

Codecov Report

Attention: Patch coverage is 58.13953% with 18 lines in your changes missing coverage. Please review.

Project coverage is 80.13%. Comparing base (84f4ebb) to head (e0168a7).
Report is 2 commits behind head on master.

Files with missing lines	Patch %	Lines
...mplementations/models/boostings_implementations.py	58.53%	17 Missing ⚠️
fedot/core/operations/evaluation/boostings.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1358      +/-   ##
==========================================
- Coverage   80.16%   80.13%   -0.03%     
==========================================
  Files         146      146              
  Lines       10492    10512      +20     
==========================================
+ Hits         8411     8424      +13     
- Misses       2081     2088       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pep8speaks · 2025-01-16T16:54:46Z

Hello @dmitryglhf! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2025-01-22 18:09:27 UTC

nicl-nno · 2025-01-17T17:16:27Z

fedot/core/operations/evaluation/operation_implementations/models/boostings_implementations.py

+        is_classification_task = input_data.task.task_type == TaskTypesEnum.classification
+        is_regression_task = input_data.task.task_type == TaskTypesEnum.regression


А task_type == ts_forecasting до сюда не может дойти? В любом случае, если случай с обоими false тут некорректен, то стоит его обработать.

Исправлено: добавлена проверка на уровень выше в модуле boostings.py

nicl-nno · 2025-01-17T17:17:29Z

fedot/core/operations/evaluation/operation_implementations/models/boostings_implementations.py

+                del params["early_stopping_rounds"]
+
+            # Create model with updated params
+            if input_data.task.task_type == TaskTypesEnum.classification:


Выше уже была такая проверка, можно переиспользовать переменную.

nicl-nno · 2025-01-17T17:25:34Z

fedot/core/operations/evaluation/operation_implementations/models/boostings_implementations.py

+            params = deepcopy(self.model_params)
+            if params.get('early_stopping_rounds'):
+                del params["early_stopping_rounds"]


Возможно лучше вот так менять параметр, через self.params.update?

https://github.com/aimclub/FEDOT/blob/master/fedot/core/operations/evaluation/operation_implementations/data_operations/ts_transformations.py#L131

Это фиксирует то что он поменялся внутри модели.

Исправлено: исправил на self.params.update.

nicl-nno · 2025-01-17T17:26:00Z

fedot/core/operations/evaluation/operation_implementations/models/boostings_implementations.py

+        train_input, eval_input = train_test_data_setup(input_data)
+
+        # Conditions for training with eval_set
+        is_classification_task = input_data.task.task_type == TaskTypesEnum.classification
+        is_regression_task = input_data.task.task_type == TaskTypesEnum.regression
+        all_classes_present_in_eval = (
+            np.unique(np.array(train_input.target)) in np.unique(np.array(eval_input.target))
+        )
+        is_using_eval_set = bool(self.params.get('use_eval_set'))


Если код идентичен тому что выше - можно вынести в функцию.

Исправлено: вынес в отдельную функцию.

…models

nicl-nno

Не забывай только помечать в комментариях какие замечания исправлены.

dmitryglhf added 3 commits January 15, 2025 14:05

update conditions for using eval set

b5f8794

added condition for using eval_set in multi_output task

42c4ad8

rollback task_assumptions

635b988

Automated autopep8 fixes

db44d74

Added new test for cases when eval_set is not provided

de8f39a

dmitryglhf and others added 3 commits January 16, 2025 19:56

Update last commit

9a8db38

bool condition for use_eval_set

07c2fa6

remove import

7512d09

dmitryglhf requested a review from nicl-nno January 17, 2025 17:03

nicl-nno reviewed Jan 17, 2025

View reviewed changes

dmitryglhf added 4 commits January 21, 2025 18:31

Disable params, condition function

58a8454

Update eval_set parameter in boostings

11650e3

Updated eval_set conditions, added ts_forecasting check for boosting …

38b982e

…models

Added condition for update params, rollback check_and_update_params

e0168a7

dmitryglhf requested a review from nicl-nno January 23, 2025 09:48

nicl-nno approved these changes Jan 23, 2025

View reviewed changes

dmitryglhf merged commit 6dd93b2 into master Jan 23, 2025
10 checks passed

dmitryglhf deleted the boostings-implementations-eval-set-fix branch January 23, 2025 10:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boostings implementations eval_set fix #1358

Boostings implementations eval_set fix #1358

dmitryglhf commented Jan 15, 2025

github-actions bot commented Jan 15, 2025 •

edited

Loading

dmitryglhf commented Jan 15, 2025

codecov bot commented Jan 15, 2025 •

edited

Loading

pep8speaks commented Jan 16, 2025 •

edited

Loading

nicl-nno Jan 17, 2025

dmitryglhf Jan 23, 2025

nicl-nno Jan 17, 2025

nicl-nno Jan 17, 2025

dmitryglhf Jan 23, 2025

nicl-nno Jan 17, 2025

dmitryglhf Jan 23, 2025

nicl-nno left a comment

		is_classification_task = input_data.task.task_type == TaskTypesEnum.classification
		is_regression_task = input_data.task.task_type == TaskTypesEnum.regression

Boostings implementations eval_set fix #1358

Boostings implementations eval_set fix #1358

Conversation

dmitryglhf commented Jan 15, 2025

Summary

github-actions bot commented Jan 15, 2025 • edited Loading

Comment last updated at Wed, 22 Jan 2025 21:10:21

dmitryglhf commented Jan 15, 2025

codecov bot commented Jan 15, 2025 • edited Loading

Codecov Report

pep8speaks commented Jan 16, 2025 • edited Loading

Comment last updated at 2025-01-22 18:09:27 UTC

nicl-nno Jan 17, 2025

Choose a reason for hiding this comment

dmitryglhf Jan 23, 2025

Choose a reason for hiding this comment

nicl-nno Jan 17, 2025

Choose a reason for hiding this comment

nicl-nno Jan 17, 2025

Choose a reason for hiding this comment

dmitryglhf Jan 23, 2025

Choose a reason for hiding this comment

nicl-nno Jan 17, 2025

Choose a reason for hiding this comment

dmitryglhf Jan 23, 2025

Choose a reason for hiding this comment

nicl-nno left a comment

Choose a reason for hiding this comment

github-actions bot commented Jan 15, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading

pep8speaks commented Jan 16, 2025 •

edited

Loading