-
-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimizations for scikit-tree to improve multi-core performance #245
Comments
I think in terms of sequential experiments to run:
If this doesn't look good, it means forsure our compiler is messed up somehow, or we introduce some serious issues in the fork that we're not aware of.
Within each of the above, we would have to investigate CPU/RAM usage in-depth using valgrind, or something... |
To confirm this is not an isolated issue with comight right? Or so far it is? |
it is not |
We ran some tests and after the fix Adam pushed the diff between RF and sktree-RF are:
Note: the result for sktree-RF was 7sec+ prior to this fix. The script for this test is found : https://github.com/neurodata/might/blob/cmi/exps/new_submission/Figure6_comight_vs_nsamples_ndims/test_rf_parallel.py The commit that we tested to get ~3sec on sktree-RF was: 7c75677 |
wooot!! |
Checklist
main
branch.pip freeze
.Description
There is occasional low CPU usage when using scikit-tree forests in parallel. Running the same code, in machines with many cores, I'm getting roughly 4-5% usage with scikit-tree forests and 60-70% using scikit-learn for the same types of problems. We should look into their Cython code optimizations and see how we can make improvements to our code base.
The text was updated successfully, but these errors were encountered: