This directory contains data I created earlier in this repository while preparing for my final project, as well as new data created while working on this project.
documents.json
: The datasets in json formatdocuments-with-id1.json
todocuments-with-id5.json
Old data that will not be used later.documents-with-id.json
: dataset with idllm_answer_cosine.csv
: Data created to evaluate LLM answers.question_vector_pickle
,question_context_answer_vector_pickle
,answer_vector_pickle
,context_vector_pickle
: Directories containing pickle files that store vector embeddings of questions, questions + context + answers, answers, and context from the documents.llm_answer
: This directory contains answers generated by the LLM model.evaluations_aqa
,evaluations_qa
: llm generated evaluation dataground_truth_data
: Ground truth data created earlier.