Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Field note_nlp."offset" should be integer, not varchar #723

Open
ekorchmar opened this issue Jan 30, 2025 · 0 comments
Open

Field note_nlp."offset" should be integer, not varchar #723

ekorchmar opened this issue Jan 30, 2025 · 0 comments

Comments

@ekorchmar
Copy link

From the documentation

Character offset of the extracted term in the input note

I am assuming that in this context this field is expected to contain a numeric value: index position of start character of the extracted token (note_nlp.lexical_variant) in the string of the entire text (note.note_text).

For instance, if the note text is "Diagnosed with eczema", value of note_nlp.lexical_variant would be eczema and value of note_nlp."offset" would be 15.

My assumption is that the purpose of this field is to specify which occurence of the same literal token is referred to. So it should just be an integer, with no extra information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant