-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Idea for tool: resegment into well-formed sentences #87
Comments
This looks wonderful and would make a great addition to the repository as part of |
What would be your preferred way of making this an optional dependency? Just letting the user run into an import error? (yes it's heavy :-)) |
I guess wrap the ImportError and provide some nice message, but yes. There's also the question of ongoing maintenance -- are you happy to help keep it up to date with new Python versions, for example? I suppose this should probably go in |
Oh, and can I also take a look at the current code before committing to anything :-) |
Yes, of course. Any feedback is welcome, and of course you are not obliged to merge this code. Here is a Colab that shows basic usage: https://colab.research.google.com/drive/1OHBylPv-8s__IU9_lwTW5CLwHQvfB9Rt?usp=sharing |
Hi, this is a great library!
I added one more tool in my fork that does automatic sentence segmentation: bricksdont#1
It changes the distribution of subtitle segments so that each subtitle is exactly one well-formed (and complete) sentence. It's not perfect, a machine learning model is involved.
Here is an example:
Would you be interested in a PR for this?
The text was updated successfully, but these errors were encountered: