22 Commits

Author SHA1 Message Date
Adityavardhan Agrawal
1792275cb8
Format fix, UI package update (#100)
Co-authored-by: Arnav Agrawal <aa779@cornell.edu>
2025-04-20 16:34:29 -07:00
Arnav Agrawal
3c1195e001
add task queue (#87)
* add task queue

* ensure task queuing is working as expected.

* add downstream sdk changes

* bugs and pr comments

* update docker arq running logic
2025-04-16 02:31:49 -04:00
Adityavardhan Agrawal
7138094f32
Refactor Python SDK: Introduce Morphik SDK, replace DataBridge references (#75) 2025-04-09 18:46:00 -07:00
Adityavardhan Agrawal
490f342407
Add litellm support across the system (#74) 2025-04-08 00:19:47 -07:00
Adityavardhan Agrawal
f3a0ea7876
Add update graphs, and custom open ai url (#63) 2025-03-29 23:22:47 -07:00
Arnav Agrawal
821e9d7e20
Add support for ColPali (#43)
* debug mps not supported

* further debug (i think i lost some braincells)

* fix mps bug and resolve dependency issues

* remove libmagic dependence

* add colpali embedding model

* multi-vector store works - verified with testing

* add integration testing

* support text embedding in colpali

* complete colplai integration and testing

* formatting + some PR comments

* remove experimental files

* resolve PR comments
2025-02-26 20:17:12 -05:00
Adityavardhan Agrawal
c8ed46b12b
Separate parsing and chunking into different function for easy rules processing (#41) 2025-02-15 13:02:15 -05:00
Adityavardhan Agrawal
20c3015038 Fix video parsing bugs, improve server logging 2025-01-30 16:03:46 -05:00
Arnav Agrawal
48e6aeb8b7
use local unstructured by default (#12) 2025-01-01 09:18:23 -05:00
Arnav Agrawal
abccf99974
add contextual embedding with claude prompt caching (#11)
* add context augmentation while chunking

* add contextual embeddings

* default config should be combined

* fix comments on PR

* update example environment

* update config and api to support env-variable optionality
2024-12-31 06:58:34 -05:00
Arnav Agrawal
3ad55129b7
Rename imports (#9) 2024-12-30 11:58:53 -05:00
Arnav Agrawal
0e4a43645a reformat files 2024-12-29 12:48:41 +05:30
Arnav Agrawal
80db083471 added frame/transcript augmentation for video retrieval 2024-12-29 12:45:12 +05:30
Arnav Agrawal
7830b42c6b fix typing errors 2024-12-29 11:10:51 +05:30
Arnav Agrawal
196655fea3 pipethrough video timestamps on query 2024-12-28 19:41:05 +05:30
Arnav Agrawal
13ab54fbf8
add a video parser + formatting changes (#4) 2024-12-26 11:34:24 -05:00
Arnav Agrawal
4f2f221d40 bug fixes and end-to-end testing 2024-12-17 21:40:38 -05:00
Adityavardhan Agrawal
df8d7fcdd0
refactor some stuff (#2)
* refactor some stuff, remove bare try catches
2024-12-15 14:31:25 -05:00
Arnav Agrawal
000887a4dc pass all tests apart from querying 2024-11-28 19:09:40 -05:00
Adityavardhan Agrawal
983a4ee854 separate text and doc ingestion pathways 2024-11-24 14:29:25 -05:00
Adityavardhan Agrawal
d70f53cf86 system changes 2024-11-22 20:58:17 -05:00
Adityavardhan Agrawal
1a926c7be0 restructuring and WIP api and sdk changes 2024-11-16 01:48:15 -05:00