Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing file in RIMES recipe #29

Open
wellescastro opened this issue Mar 30, 2018 · 2 comments
Open

Missing file in RIMES recipe #29

wellescastro opened this issue Mar 30, 2018 · 2 comments

Comments

@wellescastro
Copy link

Hi! I'm trying to execute the RIMES recipe but there's a missing file when I get to the decoding step:
./utils/prepare_word_lexicon_from_boundaries.sh

This script is called inside the decode_lm.sh:

# Build lexicon from the boundaries file. lexiconp=data/lang/forms/word/lexiconp.txt; [ "$overwrite" = false -a -s "$lexiconp" ] || ./utils/prepare_word_lexicon_from_boundaries.sh \ data/lang/forms/word/tr_boundaries.txt > "$lexiconp" || { echo "ERROR: Creating file \"$lexiconp\"!" >&2 && exit 1; }

Can I replace it with ./utils/build_word_lexicon.sh without no side effects?

Thank you!

@jpuigcerver
Copy link
Owner

jpuigcerver commented Mar 30, 2018 via email

@wellescastro
Copy link
Author

Oh, that was fast haha thank you. Okay, it's running properly now, although I'm getting the following warning in the next step (build_word_lm.sh):

1374 tokens ignored due to missing symbols!

Consequently, the vocab file was generated with just under 4000 tokens, so I'm wondering if this is normal, thank you for the support.

syms.txt: https://gist.github.com/wellescastro/daed30071246636dd6a1ff0335184b0a#file-syms-txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants