What does "do_sample" do? #413

AndreyRGW · 2023-03-18T16:34:22Z

AndreyRGW
Mar 18, 2023

What does "do_sample" do?

Hi all, I would like to know what "do_sample" does in the generation settings and why memory consumption increases after turning it off. With do_sample turned off it is almost impossible to generate on 12gb video memory and llama-13b-4bit. I get oom:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 12.00 GiB total capacity; 11.15 GiB already allocated; 0 bytes free; 11.24 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What does "do_sample" do? #413

{{title}}

Replies: 0 comments

Select a reply

What does "do_sample" do? #413

AndreyRGW Mar 18, 2023

Replies: 0 comments

AndreyRGW
Mar 18, 2023