Breaking language barriers with image detection and natural language processing model for English to Spanish translation

Guardado en:

Detalles Bibliográficos
Publicado en:	SN Applied Sciences vol. 7, no. 7 (Jul 2025), p. 682
Autor principal:	Salman, Bakhita
Otros Autores:	Lopez, Andres, Delapena, Nathanielle
Publicado:	Springer Nature B.V.
Materias:	Software Algorithms Optical character recognition Language Image detection Image processing Computer vision Machine translation Training Localization Machine learning Language translation Deep learning Coding Marking and tracking techniques Accuracy Datasets Translation Artificial intelligence Sign language Taxonomy Spanish language Natural language processing Multilingualism Linguistics Literature reviews Real time English language Edge detection Economic
Acceso en línea:	Citation/Abstract Full Text Full Text - PDF
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Descripción
Resumen:	This research proposes an advanced approach that integrates multiple image detection techniques and natural language processing (NLP) methodologies for English-to-Spanish language translation. The developed software accepts an image as input, which undergoes preprocessing using adaptive thresholding, morphological transformations, and edge detection algorithms such as Canny and Sobel operators to enhance text clarity. Text detection and localization are achieved using the EfficientDet and EAST (Efficient and Accurate Scene Text) detector frameworks, followed by Optical Character Recognition (OCR) using PyTesseract, a wrapper for Google’s Tesseract OCR. The detected text is passed to an NLP system for translation, which employs a sequence-to-sequence transformer model implemented with Keras, TensorFlow, and NumPy. Additional techniques, such as Byte Pair Encoding (BPE) for text tokenization and positional encoding for transformer-based attention, improve translation efficiency. An English-Spanish dictionary from Anki and a large parallel corpus dataset were used for training. The NLP pipeline leverages semantic analysis, part-of-speech tagging, and dependency parsing to preserve grammatical structure and context. Fine-tuning the transformer model parameters, including learning rate scheduling and gradient clipping, further optimized system performance. The research demonstrates a 93.7% translation accuracy, achieved by combining state-of-the-art image processing algorithms, advanced transformer architectures, and a robust training dataset. This hybrid approach significantly improves the accuracy of English-to-Spanish translations, validating the effectiveness of integrating computer vision and NLP technologies.Article highlights<list list-type="bullet"><list-item></list-item>Enhanced translation accuracy—AI-powered translation models improve English-to-Spanish accuracy by preserving grammar, sentence structure, and contextual meaning. The integration of deep learning and NLP ensures precise translations across various text types.<list-item>Optimized image-to-text extraction—advanced OCR techniques, including adaptive thresholding, morphological transformations, and deep learning-based text detection, enhance the accuracy of extracting text from images, even in complex backgrounds or poor lighting conditions.</list-item><list-item>Scalable AI solutions for real-world applications—the combination of computer vision and NLP enables practical applications in multilingual communication, document translation, accessibility for visually impaired users, and real-time text recognition for travel, business, and education. The AI-driven approach ensures scalability across diverse environments and languages.</list-item>
ISSN:	2523-3963 2523-3971
DOI:	10.1007/s42452-025-06891-9
Fuente:	Science Database