Skip to content

Instantly share code, notes, and snippets.

@rsbohn
Last active November 15, 2023 11:35
Show Gist options
  • Save rsbohn/d3a8fc564199f7f49b009ef658742d6e to your computer and use it in GitHub Desktop.
Save rsbohn/d3a8fc564199f7f49b009ef658742d6e to your computer and use it in GitHub Desktop.
Cracking the ChatGPT Puzzlebox

Cracking the ChatGPT Puzzlebox

I downloaded my chat history, now what? Maybe llm can help.

schema please

jq '.[0]' conversations.json | llm "extract schema"

I get a schema for each entry in the log. It's big!

what to index?

I want the 'title', 'conversation_id', and 'create_time'.

jq -c ".[] | {conversation_id, title, create_time}" conversations.json > index.jsonl

I put the 'conversation_id' first for llm embed-multi.

next steps

  • use llm embed to create an embedding of the index.
  • use llm similar to search the index
  • write jq queries to extract an item by 'conversation_id'
  • consider putting the whole log into sqlite. ??
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment