divide by 0 error #48

bendichter · 2024-08-08T20:35:19Z

running the testing files on an M1 mac, on step 3, Model Creation, I get this traceback in the log:

[LOGGING STARTED AT: 2024-08-08 16-33-41]2024-08-08 16:33:41.664 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 378 : Train Variational Autoencoder - model name: VAME
2024-08-08 16:33:41.665 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 392 : warning, a GPU was not found... proceeding with CPU (slow!)
2024-08-08 16:33:41.665 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 437 : Latent Dimensions: 30, Time window: 30, Batch Size: 256, Beta: 1, lr: 0.0005
2024-08-08 16:33:41.680 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 487 : Scheduler step size: 100, Scheduler gamma: 0.20
2024-08-08 16:33:41.680 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 493 : Start training...
2024-08-08 16:33:41.681 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 87 : Training Model: 0%| | 0/499 [00:00<?, ?epoch/s]
2024-08-08 16:33:42.637 INFO --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 87 : Training Model: 0%| | 0/499 [00:00<?, ?epoch/s]
2024-08-08 16:33:42.637 ERROR --- [Thread-150 (process_request_thread)] vame.model.rnn_vae : 566 : An error occurred: float division by zero
Traceback (most recent call last):
File "vame/model/rnn_vae.py", line 502, in train_model
File "vame/model/rnn_vae.py", line 351, in test
ZeroDivisionError: float division by zero

bendichter · 2024-08-09T23:03:26Z

@vinicvaz, any idea why this error might be occuring?

vinicvaz · 2024-08-10T02:58:25Z

Hey @bendichter
What is the size of the data you are using?
If your data is small maybe it should be related to a big batch_size.
Can you try reducing the batch_size to see if it works?

bendichter · 2024-08-10T03:53:45Z

It was the test data with default params

bendichter · 2024-08-11T16:09:41Z

@luiztauffer can you please look into this

vinicvaz · 2024-08-13T12:32:21Z

@bendichter the testing files you mean the raw files or the cropped ones that are in the vame repository in the tests folder? Can you share the link to the files you are using so I can reproduce it here?
Thnkas

bendichter · 2024-08-13T13:25:51Z

https://github.com/catalystneuro/vame-desktop/tree/main/testing

vinicvaz · 2024-08-13T13:51:54Z

Got it. This is the cropped data, and it's quite small, so a batch_size=256 is too big. Could you please test with smaller values and let me know if it still breaks?

luiztauffer · 2024-08-16T09:24:45Z

I can reproduce the error, it is indeed due to using a large batch size for a small dataset.
Since this is a vame-py related error, not desktop app, I moved it here: EthoML/VAME#75

bendichter · 2024-08-16T12:26:56Z

@luiztauffer could you fix this by adjusting the batch size?

luiztauffer · 2024-08-16T12:27:49Z

yes, it needs to be small, try it with 10 for example

luiztauffer · 2024-08-16T12:29:40Z

@bendichter see this: https://github.com/EthoML/VAME/blob/5247b82946f15e2cb5224e99cef7980506962875/tests/conftest.py#L27-L30

bendichter · 2024-08-16T12:31:59Z

Could you create a test config with proper settings? I understand that mechanically this not an issue with the desktop app, but practically it is because we aren't sufficiently communicating to a naive user how to run the app all the way through. Creating a config file that works for the test data would go a long way.

luiztauffer · 2024-08-16T12:40:36Z

@bendichter maybe it's better to point to this dataset, instead? That's what people should use for testing themselves: https://ethoml.github.io/VAME/docs/getting_started/running/#1-download-the-necessary-resources
the data you're using is the one we use for the github actions only

bendichter · 2024-08-16T13:11:19Z

OK, so far so good. The training step says it could take 6.5 hours so I won't be able to test the whole thing for a while but it is running now. Let's add this to the README

bendichter assigned vinicvaz Aug 9, 2024

luiztauffer closed this as completed Aug 16, 2024

luiztauffer mentioned this issue Aug 16, 2024

float division by zero for small datasets EthoML/VAME#75

Open

bendichter reopened this Aug 16, 2024

bendichter mentioned this issue Aug 16, 2024

add location of example data #53

Merged

bendichter closed this as completed in #53 Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

divide by 0 error #48

divide by 0 error #48

bendichter commented Aug 8, 2024

bendichter commented Aug 9, 2024

vinicvaz commented Aug 10, 2024

bendichter commented Aug 10, 2024

bendichter commented Aug 11, 2024

vinicvaz commented Aug 13, 2024

bendichter commented Aug 13, 2024

vinicvaz commented Aug 13, 2024

luiztauffer commented Aug 16, 2024 •

edited

Loading

bendichter commented Aug 16, 2024 •

edited

Loading

luiztauffer commented Aug 16, 2024

luiztauffer commented Aug 16, 2024

bendichter commented Aug 16, 2024

luiztauffer commented Aug 16, 2024

bendichter commented Aug 16, 2024

divide by 0 error #48

divide by 0 error #48

Comments

bendichter commented Aug 8, 2024

bendichter commented Aug 9, 2024

vinicvaz commented Aug 10, 2024

bendichter commented Aug 10, 2024

bendichter commented Aug 11, 2024

vinicvaz commented Aug 13, 2024

bendichter commented Aug 13, 2024

vinicvaz commented Aug 13, 2024

luiztauffer commented Aug 16, 2024 • edited Loading

bendichter commented Aug 16, 2024 • edited Loading

luiztauffer commented Aug 16, 2024

luiztauffer commented Aug 16, 2024

bendichter commented Aug 16, 2024

luiztauffer commented Aug 16, 2024

bendichter commented Aug 16, 2024

luiztauffer commented Aug 16, 2024 •

edited

Loading

bendichter commented Aug 16, 2024 •

edited

Loading