A Comprehensive Study on Text Detection and Extraction from Images and PDF Documents

Mayank Deshmukh; Saloni Rabde; Priyanka Makode; Sourabh Jasuja; Bhavesh Khasdev

A Comprehensive Study on Text Detection and Extraction from Images and PDF Documents

Authors: Mayank Deshmukh, Saloni Rabde, Priyanka Makode, Sourabh Jasuja, Bhavesh Khasdev

Country: India

Full-text Research PDF File: View | Download

Abstract: The growing need for digitization and intelligent document processing has led to significant advancements in text detection and extraction technologies. This paper reviews methodologies and tools employed for extracting textual information from images and Portable Document Format (PDF) files. Both traditional Optical Character Recognition (OCR) techniques and modern deep learning-based approaches are discussed. Five major research contributions in this area are analyzed in detail. The paper further explores challenges in handling complex document layouts, multilingual text, and low-quality images, and highlights research gaps and future directions that emphasize the potential of artificial intelligence and multimodal learning to enhance text extraction accuracy and efficiency.

Keywords: OCR, Text Extraction, Deep Learning, Layout LM, Scene Text Detection, Document Analysis.

Paper Id: 232796

Published On: 2025-11-07

Published In: Volume 13, Issue 6, November-December 2025

All research papers published in this journal/on this website are openly accessible and licensed under Creative Commons Attribution-ShareAlike 4.0 International License; accordingly, any user can read, download, copy, distribute, print, search, or link to the full texts of the authors/researchers submitted and published articles, crawl them for indexing, pass them as data to any software, or use them for any other lawful purpose. The journal is fulfilling the DOAJ's definition of open access.

About IJIRMPS Indexing & Archiving Publication Ethics Peer Review & Plagiarism	Website/Journal Policies Usage Policy Content Policies Privacy Policy	Contact Us +91-9687-828-838 editor@ijirmps.org

International Journal of Innovative Research in Engineering & Multidisciplinary Physical Sciences
E-ISSN: 2349-7300 • Impact Factor - 9.907

A Widely Indexed Open Access Peer Reviewed Online Scholarly International Journal

A Comprehensive Study on Text Detection and Extraction from Images and PDF Documents

Share this

International Journal of Innovative Research in Engineering & Multidisciplinary Physical Sciences E-ISSN: 2349-7300 • Impact Factor - 9.907

A Widely Indexed Open Access Peer Reviewed Online Scholarly International Journal

A Comprehensive Study on Text Detection and Extraction from Images and PDF Documents

Share this

International Journal of Innovative Research in Engineering & Multidisciplinary Physical Sciences
E-ISSN: 2349-7300 • Impact Factor - 9.907