Implement parser.py #22

alexhebing · 2019-05-24T10:15:52Z

icab_parser.py is currently hard-coded against the ICAB corpus. Make it a little bit more generic by allowing the user to define from which elements to extract the inner text.

Perhaps also allow the user to define which attribute(s) from which element(s) contains the text needed to NER. Experiment a little with this last option, but not too long. As long as the exact corpus that the scripts in this repo will be used against (hence with which it will have to work) is not decided, the options and possibilities are probably too many to come up with a properly generic solution.

The text was updated successfully, but these errors were encountered:

alexhebing · 2019-05-27T10:09:29Z

In our meeting today, clients told me that the corpora that they are most interested in working with are already in txt files. Make it so that this script works for extracting from basic elements (i.e. pass element name to extract text from) and for the Europeana corpus (i.e. extracting the text from the attributes of certain elements).

alexhebing added the enhancement New feature or request label May 24, 2019

alexhebing added this to the Command line v2 / evaluation milestone May 27, 2019

alexhebing changed the title ~~Make icab_parser.py slightly more generic~~ Implement parser.py May 27, 2019

alexhebing self-assigned this Jun 3, 2019

alexhebing pushed a commit that referenced this issue Jun 3, 2019

#22. First working version with tests

69a8147

alexhebing pushed a commit that referenced this issue Jun 3, 2019

#22. README added.

51f9075

alexhebing pushed a commit that referenced this issue Jun 3, 2019

#22. Some further tests added

31b67e2

alexhebing mentioned this issue Jun 4, 2019

evaluate multiNER performance #12

Open

alexhebing pushed a commit that referenced this issue Jun 14, 2019

Refer #22. Use BeautifulSoup instead of ElementTree

5cfcc96

JosedeKruif unassigned alexhebing Dec 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement parser.py #22

Implement parser.py #22

alexhebing commented May 24, 2019

alexhebing commented May 27, 2019

Implement parser.py #22

Implement parser.py #22

Comments

alexhebing commented May 24, 2019

alexhebing commented May 27, 2019