leadershop

πŸŽ“ marksheet-information-extraction-api - Extract Information Effortlessly from Marksheet Images

Download

πŸ‘‹ Introduction

Welcome to the marksheet-information-extraction-api! This is a powerful backend service built with FastAPI. It helps you extract structured information from academic marksheets, whether they are in image or PDF format. The service uses Optical Character Recognition (OCR) combined with a large language model (LLM) to give you accurate results in JSON format, along with confidence scores for each piece of information.

πŸš€ Getting Started

Here’s how you can easily get started:

  1. Download the Application
    • To begin, you will need to download the application. Just visit the Releases page to get the latest version.
  2. Install Required Software
    • You may need to install additional software to run the application smoothly. Typically, you should have:
      • Python 3.6 or newer installed on your machine.
      • Docker, if you prefer to run the application in a containerized environment.
  3. Setting Up the Environment
    • If you are using Python, create a virtual environment for easy management of dependencies:
      python -m venv myenv
      source myenv/bin/activate  # On Windows use: myenv\Scripts\activate
      
    • Then, install the required packages:
      pip install -r requirements.txt
      
  4. Run the Application
    • If you downloaded a standalone version:
      • Follow the provided instructions in the zip or installer package.
    • If you are using Docker, you can run the following command:
      docker-compose up
      
  5. Accessing the API
    • By default, the API runs on http://localhost:8000. Open this address in your web browser to check its status and see the documentation.

πŸ“¦ Download & Install

You can get the latest version of the marksheet-information-extraction-api from the Releases page. Follow the instructions on that page to download the appropriate file for your operating system.

Once you have downloaded the application, follow the setup instructions outlined in the β€œGetting Started” section.

πŸ” Features

πŸ’» System Requirements

To run the application smoothly, ensure your system meets the following requirements:

πŸ”§ API Documentation

Once the API is running, you can access its documentation at http://localhost:8000/docs. This interface will guide you through the available endpoints, request parameters, and response formats.

πŸ“„ Example Usage

To use the API, send a POST request to the /extract endpoint with your marksheet image or PDF. You can use tools like Postman or curl for this purpose.

Example using curl:

curl -X POST "http://localhost:8000/extract" -F "file=@path_to_your_marksheet.pdf"

The API will return a JSON response with extracted data and confidence scores.

πŸ‘₯ Community & Support

If you have questions or need help, feel free to open an issue in the GitHub repository. Our community and maintainers will be happy to assist you.

🚧 Known Issues

πŸŽ‰ Acknowledgements

Thanks to the contributors of FastAPI, EasyOCR, and everyone involved in the development of this project. We appreciate your support!

Download