Coding Global Background
Coding Global

SmartPDFSearch

Archiviert 6 months ago
0 Nachrichten
0 Mitglieder
Erstellt 7 months ago
Aktualisiert 7 months ago
In Discord öffnen
N
! Neuer
Script Kiddie!
Rate my project

📄 Simple PDF Search Engine
A simple PDF search engine that uses TF-IDF and Cosine Similarity to find the most relevant paragraphs to a user query.

🚀 Project IdeaThis program:
1. Reads all PDF files from a specified folder.2. Extracts text and splits it into paragraphs.3. Indexes the paragraphs and converts them into numerical vectors.4. Searches for the closest matching paragraphs to the user query.
🛠️ RequirementsInstall the required libraries before running:

```bash
pip install pdfplumber scikit-learn

https://github.com/abdelmaleknenni/SmartPDFSearch