PLEASE FILL IN THE BUG REPORT BELOW ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.
Bug Description
Values in an output column have inconsistent formats even when extract_quotes is set to False. See the screenshot below.
Expected Behavior
Values in an output column have the consistent format.
Steps to Reproduce
Environment Information
Operating System:
Python Version:
3.10
Package Versions:
1.1.3
Error Messages and Logs
N/A
Screenshots
Minimal Reproduction Example
# Minimal code to reproduce the bug
text_df = pd.read_csv(args.text_filepath, sep=",", quotechar='"')
prompt = "Analyze the movie description and extract the director name."
text_input_cols = ["text"]
text_output_cols = {
"director": "The director of the movie",
}
processed_text_df = text_df.sem_extract(
text_input_cols,
text_output_cols,
# extract_quotes=False,
# return_raw_outputs=False,
)
print(processed_text_df.head())
Link to download the CSV file.
Additional Context
Checklist
PLEASE FILL IN THE BUG REPORT BELOW ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.
Bug Description
Values in an output column have inconsistent formats even when
extract_quotesis set to False. See the screenshot below.Expected Behavior
Values in an output column have the consistent format.
Steps to Reproduce
Environment Information
Operating System:
Python Version:
3.10
Package Versions:
1.1.3
Error Messages and Logs
N/A
Screenshots
Minimal Reproduction Example
Link to download the CSV file.
Additional Context
Checklist