Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BERT - Dataset loader birectional around targets #46

Open
bramiozo opened this issue Dec 2, 2021 · 0 comments
Open

BERT - Dataset loader birectional around targets #46

bramiozo opened this issue Dec 2, 2021 · 0 comments
Assignees

Comments

@bramiozo
Copy link
Collaborator

bramiozo commented Dec 2, 2021

At the moment the TextDatasetFromDataFrame class collects tag/entity sequences from the start of the document until it fills the block. This is not ideal; we would rather have the block of text surrounding the target terms:

  • if there is one term, just center around the term
  • if there are more terms, center around the int(N/2) term
@bramiozo bramiozo self-assigned this Dec 2, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant