Parseta helps extract, analyze, and process data from complex documents, images, PDFs and more with advanced AI capabilities.
- Advanced OCR & Processing: Extract text, tables, and handwriting from any document with high precision
- Auto-schema Generation: Automatically detect and adapt to document structures
- Custom Actions: Create tailored workflows with automated task processing
- Local LLM Support: Supports Local LLMs like Llama, Mistral also supports OpenAI vision models
- Sync with your company data: Clone sensitive data while maintaining privacy
- Open Source: Paresta is a AGPLv3 licensed open source project
- Run the docker compose file
- Configure the environment variables (Refer to mockenv file)
- Start extracting data from your documents!
Visit our documentation for:
- Detailed integration guides
- API reference
- Best practices
- Example implementations
- Privacy controls configuration
We love contributions! If you'd like to contribute:
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the AGPLV3 License - see the LICENSE file for details.
Note: This project is a work in progress and is not yet ready for production use. We are actively working on it and will update this README as we make progress.
Built with ❤️ for the AI community