Skip to content

devlucassantos/vnc-pdf-content-extractor-api

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vnc-pdf-content-extractor-api

🌍 EnglishPortuguês

vnc-pdf-content-extractor-api is the service responsible for extracting content from the PDFs used by the Você na Câmara (VNC) platform. In this repository, you will find the source code for the PDF content extractor API, which uses technologies such as Python and FastAPI. Additionally, the Docker container responsible for running this code is available, allowing you to execute the project quickly and easily.

How to run

Running via Docker

To run the API, you will need to have Docker installed on your machine and run the following command in the root directory of this project:

docker compose up --build

Documentation

After running the project, all the available routes for accessing the API can be found through the link:

http://localhost:8084/api/documentation

image

Você na Câmara

Você na Câmara (VNC) is a news platform developed to simplify and make accessible the legislative propositions being processed in the Chamber of Deputies of Brazil. Through the use of Artificial Intelligence, the platform synthesizes the content of these legislative documents, transforming technical and complex information into clear and objective summaries for the general public.

This project is part of the Final Paper of the platform's developers and was conceived based on architectures such as hexagonal and microservices. The solution was organized into several repositories, each with specific responsibilities within the system:

  • vnc-databases: Responsible for managing the platform's data infrastructure. Main technologies used: PostgreSQL, Redis, Liquibase, and Docker.
  • vnc-pdf-content-extractor-api: Responsible for extracting content from the PDFs used by the platform. Main technologies used: Python, FastAPI, and Docker.
  • vnc-domains: Responsible for centralizing the platform's domains and business logic. Main technology used: Go.
  • vnc-summarizer: Responsible for the software that extracts data and summarizes the propositions available on the platform. Main technologies used: Go, PostgreSQL, Amazon Web Services (AWS), and Docker.
  • vnc-api: Responsible for providing data to the platform's frontend. Main technologies used: Go, Echo, PostgreSQL, Redis, and Docker.
  • vnc-web-ui: Responsible for providing the platform's web interface. Main technologies used: TypeScript, SCSS, React, Vite, and Docker.

About

Responsible for extracting content from the PDFs used by the Você na Câmara (VNC) platform. Main technologies used: Python, FastAPI, and Docker.

Topics

Resources

License

Stars

Watchers

Forks

Contributors