WARNING: Can not find mwt: default from official model list. Ignoring it. #297

rodriguesfas · 2020-05-08T03:36:51Z

Hello, I'm running a pipeline with stanza and I get an error for the MWT analysis.
WARNING: Can not find mwt: default from official model list. Ignoring it.
Do you know what it is? For some reason this model is not low in stanza_resources.

yuhaozhang · 2020-05-08T05:46:52Z

Can you provide more details on how you initialize the pipeline? A code snippet will help us reproduce the issue.

twang18 · 2020-05-13T00:27:07Z

I am getting the same warnings for both en and zh models.

The code and logs are as follows:

stanza.download('zh-hans')
nlp = stanza.Pipeline(lang='zh-hans', processors='tokenize,mwt,pos', use_gpu=False)

Downloading https://raw.githubusercontent.com/stanfordnlp/stanza-resources/master/resources_1.0.0.json: 116kB [00:00, 1.25MB/s] 2020-05-13 08:25:16 INFO: Downloading default packages for language: zh-hans (Simplified_Chinese)... 2020-05-13 08:25:18 INFO: File exists: C:\Users\WT.YX\stanza_resources\zh-hans\default.zip. 2020-05-13 08:25:25 INFO: Finished downloading models and saved to C:\Users\WT.YX\stanza_resources. 2020-05-13 08:25:25 WARNING: Can not find mwt: default from official model list. Ignoring it. 2020-05-13 08:25:25 INFO: Loading these models for language: zh-hans (Simplified_Chinese):

2020-05-13 08:25:25 INFO: Use device: cpu 2020-05-13 08:25:25 INFO: Loading: tokenize 2020-05-13 08:25:25 INFO: Loading: pos 2020-05-13 08:25:28 INFO: Done loading processors!

stanza.download('en')
nlp = stanza.Pipeline(lang='en', processors='tokenize,mwt,pos', use_gpu=False)

Downloading https://raw.githubusercontent.com/stanfordnlp/stanza-resources/master/resources_1.0.0.json: 116kB [00:00, 1.28MB/s] 2020-05-13 08:23:02 INFO: Downloading default packages for language: en (English)... 2020-05-13 08:23:02 INFO: File exists: C:\Users\WT.YX\stanza_resources\en\default.zip. 2020-05-13 08:23:07 INFO: Finished downloading models and saved to C:\Users\WT.YX\stanza_resources. 2020-05-13 08:23:07 WARNING: Can not find mwt: default from official model list. Ignoring it. 2020-05-13 08:23:07 INFO: Loading these models for language: en (English):

Are there any tricks I am missing here? And based on the tutorials, mwtprocessor is required for pos, so will the absence of mwtprocessor affect the subsequent pos performance? Thanks for any enlightenment!

yuhaozhang · 2020-05-13T00:36:10Z

According to the Universal Dependencies tokenization guideline, many languages do not have multi-word token (MWT) expansions. These languages include English and Chinese. So we do not have MWT models for English and Chinese and you do not need the MWT processors to produce accurate UD parsing for these languages.

yuhaozhang · 2020-05-13T00:37:50Z

To add on the above answer, since this is a warning message, you can simply ignore it and it should not affect the actual running of the pipeline at all. But removing mwt from the processors list should help you get rid of the warning message.

argosopentech · 2024-03-13T16:52:30Z

I'm also seeing this warning after upgrading my Stanza version to 1.8.1:

2024-03-13 11:51:25 WARNING: Language en package default expects mwt, which has been added

I have this for my processors list: processors="tokenize"

AngledLuffa · 2024-03-13T17:02:07Z

This is expected. English has MWT, such as won't, gonna, Jennifer's.

It has caused some amount of irritation, though, such as

#1366
#1361

We are working through those issues

rodriguesfas added the bug label May 8, 2020

qipeng added the awaiting feedback label May 12, 2020

yuhaozhang closed this as completed May 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WARNING: Can not find mwt: default from official model list. Ignoring it. #297

WARNING: Can not find mwt: default from official model list. Ignoring it. #297

rodriguesfas commented May 8, 2020

yuhaozhang commented May 8, 2020

twang18 commented May 13, 2020 •

edited

Loading

yuhaozhang commented May 13, 2020

yuhaozhang commented May 13, 2020

argosopentech commented Mar 13, 2024 •

edited

Loading

AngledLuffa commented Mar 13, 2024

WARNING: Can not find mwt: default from official model list. Ignoring it. #297

WARNING: Can not find mwt: default from official model list. Ignoring it. #297

Comments

rodriguesfas commented May 8, 2020

yuhaozhang commented May 8, 2020

twang18 commented May 13, 2020 • edited Loading

yuhaozhang commented May 13, 2020

yuhaozhang commented May 13, 2020

argosopentech commented Mar 13, 2024 • edited Loading

AngledLuffa commented Mar 13, 2024

twang18 commented May 13, 2020 •

edited

Loading

argosopentech commented Mar 13, 2024 •

edited

Loading