Complex Word Identification (CWI) for Lexical Simplification (LS) in Spanish texts for patients

Corpus and notebook consisting of a total of 225 texts made up of 75 clinical trials (CTs), 75 consent forms (CFs) and 75 patient information documents (PIDs), used in the MA thesis "Lexical Simplification in Spanish Texts for Patients: the Complex Word Identification Task".

Acknowledgements

This code is adapted from https://github.com/huggingface/notebooks/blob/main/examples/token_classification.ipynb

How to cite

If you use this data, please cite as follows:

@article{2024CWI,
  title={Complex Word Identification for Lexical Simplification in Spanish Texts for Patients},
  volume={Under review},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
LS-CWI-ES.ipynb		LS-CWI-ES.ipynb
LS_CWI_ES_MarIA.ipynb		LS_CWI_ES_MarIA.ipynb
README.md		README.md
data.zip		data.zip
scitkitlearn_ner_features.ipynb		scitkitlearn_ner_features.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Complex Word Identification (CWI) for Lexical Simplification (LS) in Spanish texts for patients

Acknowledgements

How to cite

About

Releases

Packages

Contributors 2

Languages

fede-ortega/LS-CWI-ES

Folders and files

Latest commit

History

Repository files navigation

Complex Word Identification (CWI) for Lexical Simplification (LS) in Spanish texts for patients

Acknowledgements

How to cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages