Skip to content

Open-source document processing platform built for knowledge workers

License

Notifications You must be signed in to change notification settings

harishdeivanayagam/parseta

Repository files navigation

Parseta

Open-source document processing platform built for knowledge workers

👋 Welcome to Parseta

Parseta helps extract, analyze, and process data from complex documents, images, PDFs and more with advanced AI capabilities.

✨ Features

📄 Document Processing

  • Advanced OCR & Processing: Extract text, tables, and handwriting from any document with high precision
  • Auto-schema Generation: Automatically detect and adapt to document structures
  • Custom Actions: Create tailored workflows with automated task processing

🔒 Privacy & Security

  • Local LLM Support: Supports Local LLMs like Llama, Mistral also supports OpenAI vision models
  • Sync with your company data: Clone sensitive data while maintaining privacy
  • Open Source: Paresta is a AGPLv3 licensed open source project

🚀 Getting Started

  1. Run the docker compose file
  2. Configure the environment variables (Refer to mockenv file)
  3. Start extracting data from your documents!

📚 Documentation

Visit our documentation for:

  • Detailed integration guides
  • API reference
  • Best practices
  • Example implementations
  • Privacy controls configuration

🤝 Contributing

We love contributions! If you'd like to contribute:

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

📝 License

This project is licensed under the AGPLV3 License - see the LICENSE file for details.

💬 Community & Support


Note: This project is a work in progress and is not yet ready for production use. We are actively working on it and will update this README as we make progress.


Built with ❤️ for the AI community