-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compute running time for pipelines in Pycharm (pstat) #55
Comments
@Lillliant Thanks for the update @soroush-ziaeinejad Why the codeline is not stable anymore? |
Hi @hosseinfani and @Lillliant I checked the latest code on GitHub and it worked for all combinations (except gsdmm due to a bug which is resolved, please pull) on toy.synthetic dataset. On bigger datasets, you might face MemoryError. One thing that you should consider is parameter adjusting. For instance, setting the number of topics as 50 on toy.synthetic may cause very low weights for user graphs and then all connections will be cut off because weights are under the threshold. Then an error may be raised for trying to create a graph without any connections (it says the summation of weights should be non-negative and non-zero) If you still have problems with running SEERa, please send me the ParamsTemplate.py and a screenshot of the error. Thanks :) |
@Sharjeeliv fyi, see above. |
@Lillliant |
Hi @hosseinfani @soroush-ziaeinejad, sorry for not posting the updates sooner. I think I'll be able to post more frequently now that most of my personal and school matters have resolved. So far:
|
Update
Next Step
|
Update
|
@Lillliant |
Hey @Lillliant Thanks for your PR. I didn't look at the code carefully but I get this error when I run the code with argument -p True. main.py: error: unrecognized arguments: True Should I change something? |
Hi @soroush-ziaeinejad, just the command line flag |
Update
|
Update
|
@hosseinfani @soroush-ziaeinejad
Update
Pycharm has a feature to profile function execution time, call counts and call graphs, so I've been using it to profile the different pipeline combinations. GitHub doesn't seem to support the .pstat files, so I uploaded them to MS Teams for the time being. The files can be viewed using Tools>Open CProfile snapshot in Pycharm. When you hover over the functions, you can also see the location of the code.
Current Issue
As for now, I cannot run any of the pipeline except for lda.gensim combinations after the new updates to the code. I initially thought it was because I didn't install bitermplus, so I tried to resolve the "C++ redistributable version must be 14 and above" issue by downloading the C++ build tool along with the redistributable. I don't think it solved the issue, so I'm looking through the code to see what may be the issues.
Planned Next Step
The text was updated successfully, but these errors were encountered: