morphik-core/README.md

96 lines
5.7 KiB
Markdown
Raw Permalink Normal View History

2025-04-21 17:52:31 -07:00
<p align="center">
<img alt="Morphik Logo" src="assets/morphik_logo.png">
</p>
<p align="center">
<a href='http://makeapullrequest.com'><img alt='PRs Welcome' src='https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=shields'/></a>
<img alt="GitHub commit activity" src="https://img.shields.io/github/commit-activity/m/morphik-org/morphik-core"/>
<img alt="GitHub closed issues" src="https://img.shields.io/github/issues-closed/morphik-org/morphik-core"/>
<img alt="PyPI - Downloads" src="https://img.shields.io/pypi/dm/morphik">
<a href="https://discord.gg/BwMtv3Zaju"><img alt="Discord" src="https://img.shields.io/discord/1336524712817332276?logo=discord&label=discord"></a>
</p>
2025-03-25 20:10:07 -07:00
2025-04-21 17:52:31 -07:00
<!-- add a roadmap! - <a href="https://morphik.ai/roadmap">Roadmap</a> - -->
<!-- Add a changelog! - <a href="https://morphik.ai/changelog">Changelog</a> -->
2025-03-25 20:10:07 -07:00
2025-04-21 17:52:31 -07:00
<p align="center">
<a href="https://docs.morphik.ai">Docs</a> - <a href="https://discord.gg/BwMtv3Zaju">Community</a> - <a href="https://docs.morphik.ai/blogs/gpt-vs-morphik-multimodal">Why Morphik?</a> - <a href="https://github.com/morphik-org/morphik-core/issues/new?assignees=&labels=bug&template=bug_report.md">Bug reports</a>
</p>
2025-02-07 20:17:26 -05:00
2025-04-21 17:52:31 -07:00
## Morphik is an alternative to traditional RAG for highly technical and visual documents.
2025-03-04 21:30:43 -05:00
2025-04-21 17:52:31 -07:00
[Morphik](https://morphik.ai) provides developers the tools to ingest, search (deep and shallow), transform, and manage unstructured and multimodal documents. Some of our features include:
2025-01-06 14:10:12 -05:00
2025-04-21 17:52:31 -07:00
- [Multimodal Search](https://docs.morphik.ai/concepts/colpali): We employ techniques such as ColPali to build search that actually *understands* the visual content of documents you provide. Search over images, PDFs, videos, and more with a single endpoint.
- [Knowledge Graphs](https://docs.morphik.ai/concepts/knowledge-graphs): Build knowledge graphs for domain-specific use cases in a single line of code. Use our battle-tested system prompts, or use your own.
- [Fast and Scalable Metadata Extraction](https://docs.morphik.ai/concepts/rules-processing): Extract metadata from documents - including bounding boxes, labeling, classification, and more.
- [Integrations](https://docs.morphik.ai/integrations): Integrate with existing tools and workflows. Including (but not limited to) Google Suite, Slack, and Confluence.
- [Cache-Augmented-Generation](https://docs.morphik.ai/python-sdk/create_cache): Create persistent KV-caches of your documents to speed up generation.
2025-03-20 22:54:18 -04:00
2025-04-21 17:52:31 -07:00
The best part? Morphik has a [free tier](https://www.morphik.ai/pricing) and is open source! Get started by signing up at [Morphik](https://www.morphik.ai/signup).
2025-03-20 22:54:18 -04:00
2025-04-21 17:52:31 -07:00
## Table of Contents
- [Getting Started with Morphik](#getting-started-with-morphik-recommended)
- [Self-hosting the open-source version](#self-hosting-the-open-source-version)
- [Using Morphik](#using-morphik)
- [Contributing](#contributing)
- [Open source vs paid](#open-source-vs-paid)
2025-04-19 22:43:28 -07:00
2025-04-21 17:52:31 -07:00
## Getting Started with Morphik (Recommended)
2025-04-19 22:43:28 -07:00
2025-04-21 17:52:31 -07:00
The fastest and easiest way to get started with Morphik is by signing up for free at [Morphik](https://www.morphik.ai/signup). Your first 200 pages and 100 queries are on us! After this, you can pay based on usage with discounted rates for heavier use.
2025-03-20 22:54:18 -04:00
2025-04-21 17:52:31 -07:00
## Self-hosting the open-source version
2025-01-07 00:45:28 -05:00
2025-04-26 03:45:10 +02:00
If you'd like to self-host Morphik, you can find the dedicated instruction [here](https://docs.morphik.ai/getting-started). We offer options for direct installation and installation via docker.
2025-01-07 00:45:28 -05:00
2025-04-21 17:52:31 -07:00
**Important**: Due to limited resources, we cannot provide full support for open-source deployments. We have an installation guide, and a [Discord community](https://discord.gg/BwMtv3Zaju) to help, but we can't guarantee full support.
2025-01-07 00:45:28 -05:00
2025-04-21 17:52:31 -07:00
## Using Morphik
2025-01-06 14:10:12 -05:00
2025-04-21 17:52:31 -07:00
Once you've signed up for Morphik, you can get started with ingesting and search your data right away.
2025-01-06 14:10:12 -05:00
2025-04-21 17:52:31 -07:00
### Code (Example: Python SDK)
For programmers, we offer a [Python SDK](https://docs.morphik.ai/python-sdk/morphik) and a [REST API](https://docs.morphik.ai/api-reference/health-check). Ingesting a file is as simple as:
2025-03-20 22:54:18 -04:00
```python
from morphik import Morphik
2025-03-20 22:54:18 -04:00
2025-04-21 17:52:31 -07:00
morphik = Morphik("<your-morphik-uri>")
morphik.ingest_file("path/to/your/super/complex/file.pdf")
2025-03-20 22:54:18 -04:00
```
2025-04-21 17:52:31 -07:00
Similarly, searching and querying your data is easy too:
2025-03-20 22:54:18 -04:00
```python
2025-04-21 17:52:31 -07:00
morphik.query("What's the height of screw 14-A in the chair assembly instructions?")
2025-03-20 22:54:18 -04:00
```
2025-04-21 17:52:31 -07:00
### Morphik Console
2025-03-20 22:54:18 -04:00
2025-04-21 17:57:02 -07:00
You can also interact with Morphik via the Morphik Console. This is a web-based interface that allows you to ingest, search, and query your data. You can upload files, connect to different data sources, and chat with your data all within the same place.
2025-03-20 22:54:18 -04:00
2025-04-21 17:57:02 -07:00
### Model Context Protocol
Finally, you can also access Morphik via MCP. Instructions are available [here](https://docs.morphik.ai/using-morphik/mcp).
2025-03-20 22:54:18 -04:00
2025-01-06 14:10:12 -05:00
2025-04-21 17:52:31 -07:00
## Contributing
You're welcome to contribute to the project! We love:
- Bug reports via [GitHub issues](https://github.com/morphik-org/morphik-core/issues)
- Feature requests via [GitHub issues](https://github.com/morphik-org/morphik-core/issues)
- Pull requests
2024-11-25 18:10:13 -05:00
2025-04-29 08:00:17 +02:00
Currently, we're focused on improving speed, integrating with more tools, and finding the research papers that provide the most value to our users. If you have thoughts, let us know in the discord or in GitHub!
2024-11-25 18:10:13 -05:00
2025-04-21 17:52:31 -07:00
## Open source vs paid
2024-11-25 18:10:13 -05:00
2025-04-21 17:52:31 -07:00
Certain features - such as Morphik Console - are not available in the open-source version. Any feature in the `ee` namespace is not available in the open-source version and carries a different license. Any feature outside that is open source under the MIT expat license.
2024-11-25 18:10:13 -05:00
2025-04-21 17:52:31 -07:00
## Contributors
2024-11-25 18:10:13 -05:00
2025-04-21 17:52:31 -07:00
Visit our special thanks page dedicated to our contributors [here](https://docs.morphik.ai/special-thanks).
2025-04-21 21:53:54 -07:00
## PS
We took inspiration from [PostHog](https://posthog.com) while writing this README. If you're from PostHog, thank you ❤️