684 Commits

Author SHA1 Message Date
Simón Fishman
e0baca7fbf
Rename third party examples folder (#779) 2023-10-11 19:30:35 -07:00
Simón Fishman
e3d68f8843
Fix titles for wandb notebooks (#778) 2023-10-11 19:21:52 -07:00
simonpfish
b213d75bcd Move and rename embeddings W&B example 2023-10-11 19:12:51 -07:00
gusmally
a6100e80ff
Correct legacy fine-tuning note (#770) 2023-10-11 14:53:36 -07:00
simonpfish
f52ffdaca4 Fix more uneven title sizing 2023-10-06 17:14:58 -07:00
simonpfish
056ba72710 Fix uneven title sizing 2023-10-06 17:13:47 -07:00
Emil Sedgh
f05c2c9f8b
Consider function calling roles and messages valid (#765) 2023-10-06 17:12:04 -07:00
Kai Chen
2df818aca3
Fix link to .py file (#763) 2023-10-06 17:11:08 -07:00
Anish Shah
5591ff376f
Add Weights and Biases OpenAI MLOps examples to third_party_examples (#714) 2023-10-02 14:02:34 -07:00
Fayaz Rahman
24b7a8e9b9
Add Deep Lake vector database example (#455) 2023-10-02 11:20:03 -07:00
Daniel
b01738fa1c
Add Neon Postgres to the list of vector databases in the README (#746) 2023-09-29 18:23:01 -07:00
Daniel
63e966d69a
Update Neon cookbook README.md (#747) 2023-09-29 18:22:39 -07:00
Stefano Lottini
1ca286c180
new cassIO connect experience with the newest cassio.init (#745) 2023-09-28 18:00:04 -07:00
Jiří Hofman
8d329cf9a3
Fix typo in How_to_stream_completions.ipynb (#744)
Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-09-28 17:53:36 -07:00
Dhruv Anand
f6260a013e
fix: typo in hallucination reduction stat (#742) 2023-09-28 17:50:32 -07:00
Daniel
c3f5e0cd7c
Add Neon Postgres vector database OpenAI cookbook (#690)
Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-09-28 17:48:41 -07:00
Ikko Eltociear Ashimine
6efa2eca0b
Fix typo in Backtranslation_of_SQL_queries.py (#709) 2023-09-27 17:50:53 -07:00
Simón Fishman
c2959fd60b
[tiktoken_counting] fix tokenizer name (#741) 2023-09-27 16:12:31 -07:00
Saarika Bhasi
4631e1b74a
[elasticsearch] fix typo in signup url (#726)
Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-09-27 16:02:50 -07:00
Albarqawi
bc78c9871c
Add a notebook for end to end automation example (#527) 2023-09-27 15:59:29 -07:00
Farzad Sunavala
552262ea89
Minor change to use SearchIndexingBufferedSender to support optimized batch indexing (#712) 2023-09-26 16:43:05 -07:00
ridrisa
c777f1025a
fix minor typo in financial_document_analysis_with_llamaindex (#733) 2023-09-26 16:25:55 -07:00
Chuong Ho
561efac6d3
Add Mongodb Atlas Vector Search (#734) 2023-09-26 16:22:58 -07:00
Simón Fishman
222a85fb17
Improve parallel processing script (#735) 2023-09-25 12:09:39 -07:00
gusmally
fde2a6474d
add note about legacy fine tuning (#729) 2023-09-22 10:49:07 -07:00
Stefano Lottini
ba1e6004c9
Fix spelling error + misplaced copypaste in the CQL notebook (#728) 2023-09-22 10:48:55 -07:00
Cathy Chen
8ab41ac370
Add description and rename Obtain_dataset, and throw error when fine tuned model not available (#727) 2023-09-20 14:50:59 -07:00
Will DePue
efcc78953d
Improve code_search and get_embedding notebooks. (#717)
Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-09-15 16:54:29 -07:00
Nirant
fd4e31bb00
5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning (#678) 2023-09-12 09:09:23 +01:00
Farzad Sunavala
cb46cc46a6
Add Azure Cognitive Search vector database (#569)
* get started azuresearch

* master docs reference

* pablo comments

* replace placeholders with "_"

* azs latest sdk updates

* deep dive link

* openai feedback

* not print the whole vector in console

* cleanup outputs

* cleanup outputs pip
2023-09-11 16:34:15 -07:00
dongqqcom
d86a0381e6
Add Tair to examples of vector database (#609)
* Add Tair to examples of vector database

* remove hardcoding API keys

* Update examples/vector_databases/tair/Getting_started_with_Tair_and_OpenAI.ipynb

Co-authored-by: Simón Fishman <simonpfish@gmail.com>

* fix: input api key by getpass

* update: input api key by getpass

* update: Adding output of code cells

---------

Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-09-11 15:16:00 -07:00
Krista Pratico
240694fa32
[azure] bring your own data example (#654)
* initial commit for bring your own data notebook

* fix words

* add version

* add output from stream

* address review feedback

* edit intro
2023-09-11 15:12:37 -07:00
Puneet Dhiman
e4d0bc169c
Correct spelling mistake in Search_reranking_with_cross-encoders.ipynb (#675) 2023-09-11 15:10:17 -07:00
Stefano Lottini
59011825c3
Updates to CassIO notebook (#684) 2023-09-11 14:59:09 -07:00
Steven Pousty
857b592731
small typoe fix (#688) 2023-09-11 14:57:09 -07:00
Simón Fishman
5783656852
add descriptions to fine-tuning dataprep notebook (#673) 2023-09-02 08:26:55 -07:00
Simón Fishman
15f3fda4a3
clarify title (#672) 2023-09-01 10:14:11 -07:00
Will DePue
76448de0de
Update Customizing_embeddings.ipynb to be delete long cache output. (#667)
* patch

* remove print statement

---------

Co-authored-by: simonpfish <simonpfish@gmail.com>
2023-08-29 17:50:15 -07:00
Stefano Lottini
fae14ddb89
Add Cassandra/Astra DB to the vector databases README (#665)
* Add Cassandra/Astra DB to the vector databases overall README

* Linking to a basic quickstart instead
2023-08-29 15:20:51 -07:00
Liam Thompson
31b4de22a3
Add elasticsearch examples to vector databases folder (#622)
* Add Elasticsearch to vector databases, add notebooks

* Update prompt

* Make intro verbiage more neutral

* Add semantic search notebook outputs

* Add RAG notebook output

* Update query

* Remove unreadable vector output
2023-08-29 10:54:08 -07:00
Stefano Lottini
4d330b82d7
Add Astra DB (/Cassandra) to vector databases with example notebooks (#655)
* first commit for the Astra DB / Cassandra notebooks

* add json with quotes

* towards the final cassIO pilot notebook

* small changes to the copy

* cassIO pilot completed and its readme done

* fix silly markdown error

* fix silly markdown error 2

* astra vector link change and vector search picture improved

* added link to docs for connecting to Cassandra cluster

* CQL version of the flow

* revised readme

* final adjustments around metrics/distances

* links' final version assumes files in openai's main branch

* Add ref to QA+vector general guide; fix prompt; clarified conclusion paragraph; typos
2023-08-29 10:27:49 -07:00
recordcrash
1945bfe65c
Fix UTF-8 encoding in Chat_finetuning_data_prep.ipynb (#648) 2023-08-28 18:12:30 -07:00
Eliah Kagan
63f95154b1
Add Tiktokenizer link in "How to count tokens" (#604)
This adds a link to Tiktokenizer webapp as another tool, in
addition to the OpenAI Tokenizer.
2023-08-28 10:28:19 -07:00
Viet Hoang Tran Duong
1ae3bf631b
fix undefined variables in fine tune example (#660)
`create_user_message` and `test_df` are not defined.
2023-08-25 13:03:06 -07:00
Simón Fishman
35b7123faf
update titles (#653) 2023-08-22 16:32:43 -07:00
Simón Fishman
d534c85477
more fine-tuning improvements (#652)
* more fine-tuning improvements

* add links to other resources
2023-08-22 16:27:08 -07:00
Simón Fishman
8ed84645e8
small improvements to the fine-tuning cookbook (#651) 2023-08-22 15:30:03 -07:00
simonpfish
cbe292bd93 update fine-tuning cookbook 2023-08-22 13:48:39 -07:00
Michael Wu
a173325830
add ft data prep notebook (#647) 2023-08-22 12:24:42 -07:00
colin-openai
524949f9d1
Pushing cookbook for fine-tuning via ChatCompletion (#646)
* Pushing cookbook for fine-tuning via ChatCompletion

* Add correct file

* Fixed bug with create_prompt function and refactored files
2023-08-22 12:24:22 -07:00