Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 1.05 KB

README.md

File metadata and controls

19 lines (13 loc) · 1.05 KB

DSS_S23_Scripture

Society's Scripture Matching

code

  • run_extractor.py python script that cleans the data and runs the MatchExtractor class comparing the scripture dataset phrases with the woodruff journal entries dataset.

  • MatchExtractor.py class that takes in Woodruff journal entries dataset, splits the entries into a list of phrase_length word phrases, then loops through all entries and scriptures to find matches. includes extensions so that matches over phrase_langth can be found.

  • StringUtil.py string and pandas dataframe utility class to fix some the existing pandas dataframe methods.

data

in progress

  • add internal id, parent id, order columns to matches
  • streamlit app with some analysis