Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Voice chat support #30

Open
psugihara opened this issue Nov 2, 2023 · 2 comments
Open

Voice chat support #30

psugihara opened this issue Nov 2, 2023 · 2 comments

Comments

@psugihara
Copy link
Owner

There's a fantastic demo here https://github.com/ggerganov/whisper.cpp/tree/master/examples/talk-llama. I think a good UI could be putting a button in the text input like iOS messages. Hitting that would put you into voice mode. Apple's built in TTS is probably fine for now for the response (though I wonder if it has streaming support)

Screenshot 2023-11-02 at 8 33 14 AM

Step 1 would be rigging up a little prototype/demo like georgi's linked above but running in FreeChat.

@psugihara
Copy link
Owner Author

could be useful
https://github.com/yl4579/StyleTTS2

@johnlunney
Copy link

"Apple's built in TTS is probably fine for now"

On Apple Silicon, could maybe use Apple's on-device speech recognition to keep the file size down... https://developer.apple.com/documentation/speech

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants