68 Commits

Author SHA1 Message Date
Adityavardhan Agrawal
df3a40583a Add app id scoping for graphs and folders 2025-05-03 18:48:11 -07:00
Adityavardhan Agrawal
1897485c87
Add app_id scoping and fix raw image extraction (#129) 2025-05-03 12:29:44 -07:00
Arnav Agrawal
5538b4787a
Deep research (#126) 2025-05-01 17:02:22 -07:00
Adityavardhan Agrawal
de1a7d2fd7
Staged Rule Execution (#111) 2025-04-23 23:15:03 -07:00
Adityavardhan Agrawal
792f082e05
Add structured output to completion response for query (#108) 2025-04-23 10:39:06 -07:00
Adityavardhan Agrawal
1792275cb8
Format fix, UI package update (#100)
Co-authored-by: Arnav Agrawal <aa779@cornell.edu>
2025-04-20 16:34:29 -07:00
Arnav Agrawal
c56f66349e
switch to decorator pattern for telemetry (#99) 2025-04-19 19:13:51 -04:00
Adityavardhan Agrawal
09622cc3fc
Fix pytest to use redis queue, bug fixes (#98) 2025-04-19 18:24:53 -04:00
Adityavardhan Agrawal
6fcb130d58
UI improvements: direct chat with docs (#97) 2025-04-18 23:11:48 -07:00
Adityavardhan Agrawal
5b06bfa38e
Chat sources, improve logger, UI improvements (#94) 2025-04-18 00:26:27 -07:00
Arnav Agrawal
25e8b8b8e9
Folder extraction (#92) 2025-04-17 20:52:18 -07:00
Arnav Agrawal
4ffe0b3bac
Add pooling changes for scalable ingestion (#90) 2025-04-16 21:45:34 -07:00
Adityavardhan Agrawal
be96caa303
Add folders table, add to document and other methods (#88) 2025-04-16 01:16:54 -07:00
Arnav Agrawal
3c1195e001
add task queue (#87)
* add task queue

* ensure task queuing is working as expected.

* add downstream sdk changes

* bugs and pr comments

* update docker arq running logic
2025-04-16 02:31:49 -04:00
Adityavardhan Agrawal
75556c924a
Add folders and user scopes (#82) 2025-04-13 14:52:26 -07:00
Arnav Agrawal
1f3df392da
billing changes for paid tiers (#83)
* billing changes for paid tiers

* PR comments

* initialize userdb on startup if in cloud mode
2025-04-13 14:11:12 -07:00
Adityavardhan Agrawal
7138094f32
Refactor Python SDK: Introduce Morphik SDK, replace DataBridge references (#75) 2025-04-09 18:46:00 -07:00
Adityavardhan Agrawal
490f342407
Add litellm support across the system (#74) 2025-04-08 00:19:47 -07:00
Arnav Agrawal
79865f0bd1 initialize vector store before multivector store 2025-04-06 12:48:22 -07:00
Adityavardhan Agrawal
5ae396cddb Squashed changes from hosted-service 2025-04-03 12:54:54 -07:00
Adityavardhan Agrawal
bf7c90164f change document endpoint to POST, and adjust API calls for consistency 2025-04-03 10:47:34 -07:00
Adityavardhan Agrawal
08893733f6
Add custom prompt and example injections for query and graph creation (#68) 2025-03-31 21:30:48 -07:00
Adityavardhan Agrawal
f3a0ea7876
Add update graphs, and custom open ai url (#63) 2025-03-29 23:22:47 -07:00
Adityavardhan Agrawal
9ce0507616
Add delete document endpoint (#62) 2025-03-29 18:42:52 -07:00
Adityavardhan Agrawal
6ef3ec207e Reduce extra logging, change to debugs 2025-03-27 20:05:27 -07:00
Adityavardhan Agrawal
7eb5887d2f
Add hosted tier limits, cloud uri gen (#59) 2025-03-27 17:30:02 -07:00
Adityavardhan Agrawal
adc0b2dbb8
Add batch ingestion (#55) 2025-03-18 23:27:53 -04:00
Adityavardhan Agrawal
4ae132ff46
Implement knowledge graphs, and graph enhanced querying (#48) 2025-03-17 17:36:43 -04:00
Adityavardhan Agrawal
32a5d787fe
Add update methods with add update strategy (#53) 2025-03-13 11:26:01 -04:00
Adityavardhan Agrawal
38683df0f3
Add completion sources and batch retrieval for docs and chunks (#51) 2025-03-09 18:42:04 -04:00
LukeZekes
e56691a1c5
add filename option for text documents (#47) 2025-03-05 10:56:02 -05:00
Arnav Agrawal
821e9d7e20
Add support for ColPali (#43)
* debug mps not supported

* further debug (i think i lost some braincells)

* fix mps bug and resolve dependency issues

* remove libmagic dependence

* add colpali embedding model

* multi-vector store works - verified with testing

* add integration testing

* support text embedding in colpali

* complete colplai integration and testing

* formatting + some PR comments

* remove experimental files

* resolve PR comments
2025-02-26 20:17:12 -05:00
Adityavardhan Agrawal
c8ed46b12b
Separate parsing and chunking into different function for easy rules processing (#41) 2025-02-15 13:02:15 -05:00
Adityavardhan Agrawal
a46fa064c7
Add natural language rules based ingestion (#34) 2025-02-07 21:08:40 -05:00
Arnav Agrawal
d124e6aa0d
Add support for cache-augmented-generation (#30) 2025-01-28 23:49:28 -05:00
Adityavardhan Agrawal
f4c14fc71b
Streamline dev experience with optional auth and simplified config (#27) 2025-01-11 11:24:00 -05:00
Adityavardhan Agrawal
0a933e5fd6
Add docker support (#24) 2025-01-09 05:17:25 -05:00
Arnav Agrawal
f72f6f0249
Config improvements (#17) 2025-01-07 01:42:10 -05:00
Arnav Agrawal
c3726504f7
add support for PostgreSQL and pgvector (#15)
Co-authored-by: Adityavardhan Agrawal <aa729@cornell.edu>
2025-01-04 08:14:52 -05:00
Adityavardhan Agrawal
273dfcc5e7
Add PostgreSQL support (#13)
Co-authored-by: Arnav Agrawal <aa779@cornell.edu>
2025-01-04 08:11:09 -05:00
Arnav Agrawal
20faae8903
Add reranking (#14) 2025-01-02 03:42:47 -05:00
Arnav Agrawal
48e6aeb8b7
use local unstructured by default (#12) 2025-01-01 09:18:23 -05:00
Arnav Agrawal
abccf99974
add contextual embedding with claude prompt caching (#11)
* add context augmentation while chunking

* add contextual embeddings

* default config should be combined

* fix comments on PR

* update example environment

* update config and api to support env-variable optionality
2024-12-31 06:58:34 -05:00
Adityavardhan Agrawal
367dc079e8
Add local file system for storage (#10) 2024-12-31 06:25:51 -05:00
Adityavardhan Agrawal
3e4a9999ad
Add open telemetry and shell (#5) 2024-12-30 23:52:25 -05:00
Arnav Agrawal
0e4a43645a reformat files 2024-12-29 12:48:41 +05:30
Arnav Agrawal
16e5decc4b fix linting issues 2024-12-28 17:29:33 +05:30
Arnav Agrawal
b883f52a11 add ollama embeddings and test them out 2024-12-27 12:17:16 +05:30
Arnav Agrawal
418054e9a3 update configuration style to support easy model editing 2024-12-27 11:19:07 +05:30
Arnav Agrawal
13ab54fbf8
add a video parser + formatting changes (#4) 2024-12-26 11:34:24 -05:00