Knowledge graph Retrieval-Augmented Framework for Text generation (KRAFT)

This repository contains the dataset and source code for KRAFT.

We use Langchain's Python module for

Prompting OpenAI's ChatGPT-3.5-Turbo model
Embedding text using OpenAI's Ada-v2 text embedding model

Example

For the prompt "Who is Christopher Nolan's brother's wife?", with iterative KRAFT, we get the following:

Setup

Run pip install -r requirements

Dataset

We use the following 2 datasets to evaluate the performance of our framework:

StrategyQA, which consists of Yes/No questions which need multistep logical thinking
2WikiMultiHop, which consists of short-answer questions which need multistep logical thinking

Both these datasets are already in the repository.

Preprocessing

Run python src/make_mapping.py and python src/push_vectors.py

KRAFT Experiments and Evaluation

In experiments.py, we iterate over values of k, the 2 different search strategies (simple and iterative), the 2 different choose_type (i.e. Edge choosing strategies: classic and nearest neighbor), and the 2 datasets (StrategyQA and 2WikiMultiHop)

For each, we construct an ExperimentPipeline.

Usage

To use the script, you will pass arguments through the command line. The available arguments are:

--k_values: List of integer values for k
--choose_types: List of choose types (nearest_neighbor, classic)
--search_strategies: List of search strategies (simple, iterative, none)
--datasets: List of dataset file paths (datasets/)
--num_todo: Integer representing the number of items to process from each dataset.

Baseline

To run the no-retrieval GPT-3.5-Turbo Baseline, use k = 1, choos

Running the Script

Here's the basic syntax for running the script from your command line:

python experiments.py --k_values [K_VALUES] --choose_types [CHOOSE_TYPES] --search_strategies [SEARCH_STRATEGIES] --datasets [DATASETS] --num_todo [NUM_TODO]

Replace experiments.py with the actual name of your script file.

Examples

Basic Example: Run the script with a single value for k, one choose type, one search strategy, on a single dataset processing 50 items:

 python script_name.py --k_values 3 \
 --choose_types classic \
 --search_strategies simple \
 --datasets strategyqa \
 --num_todo 50

Multiple Parameters: Run the script with multiple k values, choose types, and search strategies on two datasets, processing 50 items each:

 python script_name.py --k_values 3 5 \
 --choose_types classic nearest_neighbor \
 --search_strategies simple iterative \
 --datasets strategyqa wikimultihop \
 --num_todo 50

Output

The script will output the results to files named according to the combination of parameters and dataset names. For example, if you run the script with k value of 3, choose type classic, search strategy simple, on strategyqa.json processing 50 items, the output file will be named results/strategyqa.json_simple_classic_3.json.

Notes

Ensure that the datasets are in the expected format and located in the specified paths.
The script can handle multiple datasets and parameter combinations, but be aware that processing times will increase with more complex configurations.

Real-time demo

Run streamlit run app.py to run the interactive demo

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
datasets		datasets
results		results
src		src
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py
experiments.py		experiments.py
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge graph Retrieval-Augmented Framework for Text generation (KRAFT)

Example

Setup

Dataset

Preprocessing

KRAFT Experiments and Evaluation

Usage

Baseline

Running the Script

Examples

Output

Notes

Real-time demo

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Knowledge graph Retrieval-Augmented Framework for Text generation (KRAFT)

Example

Setup

Dataset

Preprocessing

KRAFT Experiments and Evaluation

Usage

Baseline

Running the Script

Examples

Output

Notes

Real-time demo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages