word images
Recently Published Documents


TOTAL DOCUMENTS

59
(FIVE YEARS 20)

H-INDEX

12
(FIVE YEARS 2)

2021 ◽  
Vol 311 (2) ◽  
pp. 475-504
Author(s):  
Jakob Schneider ◽  
Andreas Thom

2021 ◽  
Vol 3 (8) ◽  
Author(s):  
Fetulhak Abdurahman ◽  
Eyob Sisay ◽  
Kinde Anlay Fante

AbstractAmharic ("Image missing") is the official language of the Federal Government of Ethiopia, with more than 27 million speakers. It uses an Ethiopic script, which has 238 core and 27 labialized characters. It is a low-resourced language, and a few attempts have been made so far for its handwritten text recognition. However, Amharic handwritten text recognition is challenging due to the very high similarity between characters. This paper presents a convolutional recurrent neural networks based offline handwritten Amharic word recognition system. The proposed framework comprises convolutional neural networks (CNNs) for feature extraction from input word images, recurrent neural network (RNNs) for sequence encoding, and connectionist temporal classification as a loss function. We designed a custom CNN model and compared its performance with three different state-of-the-art CNN models, including DenseNet-121, ResNet-50 and VGG-19 after modifying their architectures to fit our problem domain, for robust feature extraction from handwritten Amharic word images. We have conducted detailed experiments with different CNN and RNN architectures, input word image sizes, and applied data augmentation techniques to enhance performance of the proposed models. We have prepared a handwritten Amharic word dataset, HARD-I, which is available publicly for researchers. From the experiments on various recognition models using our dataset, a WER of 5.24 % and CER of 1.15 % were achieved using our best-performing recognition model. The proposed models achieve a competitive performance compared to existing models for offline handwritten Amharic word recognition.


Sensors ◽  
2021 ◽  
Vol 21 (14) ◽  
pp. 4648
Author(s):  
Subhranil Kundu ◽  
Samir Malakar ◽  
Zong Woo Geem ◽  
Yoon Young Moon ◽  
Pawan Kumar Singh ◽  
...  

Handwritten keyword spotting (KWS) is of great interest to the document image research community. In this work, we propose a learning-free keyword spotting method following query by example (QBE) setting for handwritten documents. It consists of four key processes: pre-processing, vertical zone division, feature extraction, and feature matching. The pre-processing step deals with the noise found in the word images, and the skewness of the handwritings caused by the varied writing styles of the individuals. Next, the vertical zone division splits the word image into several zones. The number of vertical zones is guided by the number of letters in the query word image. To obtain this information (i.e., number of letters in a query word image) during experimentation, we use the text encoding of the query word image. The user provides the information to the system. The feature extraction process involves the use of the Hough transform. The last step is feature matching, which first compares the features extracted from the word images and then generates a similarity score. The performance of this algorithm has been tested on three publicly available datasets: IAM, QUWI, and ICDAR KWS 2015. It is noticed that the proposed method outperforms state-of-the-art learning-free KWS methods considered here for comparison while evaluated on the present datasets. We also evaluate the performance of the present KWS model using state-of-the-art deep features and it is found that the features used in the present work perform better than the deep features extracted using InceptionV3, VGG19, and DenseNet121 models.


2021 ◽  
Vol 7 ◽  
pp. e596
Author(s):  
Rodney Pino ◽  
Renier Mendoza ◽  
Rachelle Sambayan

Baybayin is a pre-Hispanic Philippine writing system used in Luzon island. With the effort in reintroducing the script, in 2018, the Committee on Basic Education and Culture of the Philippine Congress approved House Bill 1022 or the ”National Writing System Act,” which declares the Baybayin script as the Philippines’ national writing system. Since then, Baybayin OCR has become a field of research interest. Numerous works have proposed different techniques in recognizing Baybayin scripts. However, all those studies anchored on the classification and recognition at the character level. In this work, we propose an algorithm that provides the Latin transliteration of a Baybayin word in an image. The proposed system relies on a Baybayin character classifier generated using the Support Vector Machine (SVM). The method involves isolation of each Baybayin character, then classifying each character according to its equivalent syllable in Latin script, and finally concatenate each result to form the transliterated word. The system was tested using a novel dataset of Baybayin word images and achieved a competitive 97.9% recognition accuracy. Based on our review of the literature, this is the first work that recognizes Baybayin scripts at the word level. The proposed system can be used in automated transliterations of Baybayin texts transcribed in old books, tattoos, signage, graphic designs, and documents, among others.


Author(s):  
Sukalpa Chanda ◽  
Daniel Haitink ◽  
Prashant Kumar Prasad ◽  
Jochem Baas ◽  
Umapada Pal ◽  
...  
Keyword(s):  

Author(s):  
Umesh D. Dixit ◽  
Rahul Hiraskar ◽  
Raghavendra Purohit ◽  
Sagar Shivanagutti
Keyword(s):  

2020 ◽  
pp. 41-59
Author(s):  
Nina Danylyuk

The article investigates the fi gurative language (words) of the biblical origin such as God, Jesus Christ, God’s Mother and other that obtained new national cultural meanings and became ethnologemes in the texts of Ukrainian folklore. The research is conducted on the basis of a great number of reference texts, written down mainly in the 19th – beginning of the 20th centuries. It was found out that the texts of oral lore are linguo-aesthetic signs of ethnic culture refl ect a mode of thinking of a nation at diff erent historical periods. It was pointed out that modern authors understand the meaning of such terms as a “word-image” and “language image” in diff erent ways and add their individual interpretations. Key words-images have been analysed in the context of the linguistic conceptual map of the Ukrainians in whom pagan and pre-Christian beliefs of our people are refl ected. It was discovered that Biblical words-images in the Ukrainian folklore were reconsidered and, as a result, it led to the changes in their forms and meanings. The analysis of the folk songs texts makes it possible to conclude that the word-images of Jesus Christ and God’s Mother are developed to the level of language images obtaining specifi c senses that are typical to a real man, a master/landlord, and a woman, a mother and a hostess/ landlady.


Sign in / Sign up

Export Citation Format

Share Document