ELCVIA Electronic Letters on Computer Vision and Image Analysis
Latest Publications


TOTAL DOCUMENTS

283
(FIVE YEARS 36)

H-INDEX

15
(FIVE YEARS 1)

Published By Universitat Autonoma De Barcelona

1577-5097

Author(s):  
Priyadharsini Ravisankar

Underwater acoustic images are captured by sonar technology which uses sound as a source. The noise in the acoustic images may occur only during acquisition. These noises may be multiplicative in nature and cause serious effects on the images affecting their visual quality. Generally image denoising techniques that remove the noise from the images can use linear and non-linear filters. In this paper, wavelet based denoising method is used to reduce the noise from the images. The image is decomposed using Stationary Wavelet Transform (SWT) into low and high frequency components. The various shrinkage functions such as Visushrink and Sureshrink are used for selecting the threshold to remove the undesirable signals in the low frequency component. The high frequency components such as edges and corners are retained. Then the inverse SWT is used for reconstruction of denoised image by combining the modified low frequency components with the high frequency components. The performance measure Peak Signal to Noise Ratio (PSNR) is obtained for various wavelets such as Haar, Daubechies,Coiflet and by changing the thresholding methods.


Author(s):  
Tarek Bentahar

In this paper, an accuracy improvement of the quality-guided phase unwrapping algorithm is proposed. Our proposal is based on a modified phase derivative variance which provides more details on local variations especially for important patterns such as fringes and edges, hence distorted regions may be re-unwrapped according to this new reliable PDV. The proposed improvement is not only effective on accuracy but also on time, the obtained results have shown that the running time with our proposal is less than that of a skillful optimization-based algorithm. To prove effectiveness, the experimental test is carried out on simulated and real data, and the comparison is made under several relevant criteria.


Author(s):  
Shivangi Anthwal

Facial expressions are integral part of non-verbal paralinguistic communication as they provide cues significant in perceiving one’s emotional state. Assessment of emotions through expressions is an active research domain in computer vision due to its potential applications in multi-faceted domains. In this work, an approach is presented where facial expressions are modelled and analyzed with dense optical flow derived divergence and curl templates that embody the ideal motion pattern of facial features pertaining to unfolding of an expression on the face. Two types of classification schemes based on multi-class support vector machine and k-nearest neighbour are employed for evaluation. Promising results obtained from comparative analysis of the proposed approach with state-of-the-art techniques on the Extended Cohn Kanade database and with human cognition and pre-trained Microsoft face application programming interface on the Karolinska Directed Emotional Faces database validate the efficiency of the approach.


2021 ◽  
Vol 20 (1) ◽  
pp. 99-116
Author(s):  
Chakradhar Guntuboina ◽  
Aditya Porwal ◽  
Preet Jain ◽  
Hansa Shingrakhia

This paper proposes a computationally inexpensive method for automatic key-event extraction and subsequent summarization of sports videos using scoreboard detection. A database consisting of 1300 images was used to train a supervised-learning based object detection algorithm, YOLO (You Only Look Once). Then, for each frame of the video, once the scoreboard was detected using YOLO, the scoreboard was cropped out of the image. After this, image processing techniques were applied on the cropped scoreboard to reduce noise and false positives. Finally, the processed image was passed through an OCR (Optical Character Recognizer) to get the score. A rule-based algorithm was run on the output of the OCR to generate the timestamps of key-events based on the game. The proposed method is best suited for people who want to analyse the games and want precise timestamps of the occurrence of important events. The performance of the proposed design was tested on videos of Bundesliga, English Premier League, ICC WC 2019, IPL 2019, and Pro Kabaddi League. An average F1 Score of 0.979 was achieved during the simulations. The algorithm is trained on five different classes of three separate games (Soccer, Cricket, Kabaddi). The design is implemented using python 3.7.


Author(s):  
FAHMI AKMAL DZULKIFLI

Contrast enhancement plays an important part in image processing. In histology, the application of a contrast enhancement technique is necessary since it can help pathologists in diagnosing the sample slides by increasing the visibility of the morphological and features of cells in an image. Various techniques have been proposed to enhance the contrast of microscopic images. Thus, this paper aimed to study the effectiveness of contrast enhancement techniques in enhancing the Ki67 images of astrocytoma. Three contrast enhancement techniques consist of contrast stretching, histogram equalization, and CLAHE techniques were proposed to enhance the sample images. The performance of each technique was compared by computing seven quantitative measures. The CLAHE technique was preferred for enhancing the contrast of the astrocytoma images. This technique produces good results especially in contrast enhancement, edge conservation and enhancement, brightness preservation, and minimum distortions to the enhanced images. 


2021 ◽  
Vol 20 (1) ◽  
pp. 117-136
Author(s):  
Cheng Yang ◽  
Xiang Yu ◽  
Arun Kumar ◽  
G.G. Md. Nawaz Ali ◽  
Peter Han Joo Chong ◽  
...  

This paper introduces a method to use deep convolutional neural networks (CNNs) to automatically replace advertisement (AD) photo on social (or self-media) videos and provides the suitable evaluation method to compare different CNNs. An AD photo can replace a picture inside a video. However, if a human being occludes the replaced picture in the original video, the newly pasted AD photo will block the human occluded part. The deep learning algorithm is implemented to segment the human being from the video. The segmented human pixels are then pasted back to the occluded area, so that the AD photo replacement becomes natural and perfect appearance in the video. This process requires the predicted occlusion edge to be closed to the ground truth occlusion edge, so that the AD photo can be occluded naturally. Therefore, this research introduces a curve fitting method to measure the predicted occlusion edge’s error. By using this method, three CNN methods are applied and compared for the AD replacement. They are mask of regions convolutional neural network (Mask RCNN), a recurrent network for video object segmentation (ROVS) and DeeplabV3. The experimental results show the comparative segmentation accuracy of the different models and DeeplabV3 shows the best performance.


Author(s):  
Sankirti Sandeep Shiravale ◽  
Jayadevan R ◽  
Sanjeev S Sannakki
Keyword(s):  

Author(s):  
Aditya Kakde ◽  
Durgansh Sharma ◽  
Bhavana Kaushik ◽  
Nitin Arora

Sign in / Sign up

Export Citation Format

Share Document