proximity graph
Recently Published Documents


TOTAL DOCUMENTS

42
(FIVE YEARS 14)

H-INDEX

6
(FIVE YEARS 1)

Author(s):  
Hayato Nakama ◽  
Daichi Amagata ◽  
Takahiro Hara

Author(s):  
Larissa C. Shimomura ◽  
Daniel S. Kaster

Similarity searching is a widely used approach to retrieve complex data (images, videos, time series, etc.). Similarity searches aim at retrieving similar data according to the intrinsic characteristics of the data. Recently, graph-based methods have emerged as a very efficient alternative for similarity retrieval, with reports indicating they have outperformed methods of other categories in several situations. This work presents two main contributions to graph-based methods for similarity searches. The first contribution is a survey on the main graph types currently employed for similarity searches and an experimental evaluation of the most representative graphs in a common platform for exact and approximate search algorithms. The second contribution is a new graph-based method called HGraph, which is a connected-partition approach to build a proximity graph and answer similarity searches. Both of our contributions and results were published and received awards in international conferences.


Author(s):  
Shulong Tan ◽  
Zhaozhuo Xu ◽  
Weijie Zhao ◽  
Hongliang Fei ◽  
Zhixin Zhou ◽  
...  

2021 ◽  
pp. 107305
Author(s):  
Xiaoliang Xu ◽  
Mengzhao Wang ◽  
Yuxiang Wang ◽  
Dingcheng Ma

2021 ◽  
Author(s):  
Melanie Kirsche ◽  
Gautam Prabhu ◽  
Rachel Sherman ◽  
Bohan Ni ◽  
Sergey Aganezov ◽  
...  

The increasing availability of long-reads is revolutionizing studies of structural variants (SVs). However, because SVs vary across individuals and are discovered through imprecise read technologies and methods, they can be difficult to compare. Addressing this, we present Jasmine (https://github.com/mkirsche/Jasmine), a fast and accurate method for SV refinement, comparison, and population analysis. Using an SV proximity graph, Jasmine outperforms five widely-used comparison methods, including reducing the rate of Mendelian discordance in trio datasets by more than five-fold, and reveals a set of high confidence de novo SVs confirmed by multiple long-read technologies. We also present a harmonized callset of 205,192 SVs from 31 samples of diverse ancestry sequenced with long reads. We genotype these SVs in 444 short read samples from the 1000 Genomes Project with both DNA and RNA sequencing data and assess their widespread impact on gene expression, including within several medically relevant genes.


2020 ◽  
Vol 34 (01) ◽  
pp. 139-146
Author(s):  
Jie Liu ◽  
Xiao Yan ◽  
Xinyan Dai ◽  
Zhirong Li ◽  
James Cheng ◽  
...  

The inner-product navigable small world graph (ip-NSW) represents the state-of-the-art method for approximate maximum inner product search (MIPS) and it can achieve an order of magnitude speedup over the fastest baseline. However, to date it is still unclear where its exceptional performance comes from. In this paper, we show that there is a strong norm bias in the MIPS problem, which means that the large norm items are very likely to become the result of MIPS. Then we explain the good performance of ip-NSW as matching the norm bias of the MIPS problem — large norm items have big in-degrees in the ip-NSW proximity graph and a walk on the graph spends the majority of computation on these items, thus effectively avoids unnecessary computation on small norm items. Furthermore, we propose the ip-NSW+ algorithm, which improves ip-NSW by introducing an additional angular proximity graph. Search is first conducted on the angular graph to find the angular neighbors of a query and then the MIPS neighbors of these angular neighbors are used to initialize the candidate pool for search on the inner-product proximity graph. Experiment results show that ip-NSW+ consistently and significantly outperforms ip-NSW and provides more robust performance under different data distributions.


Author(s):  
Rafael Seidi Oyamada ◽  
Larissa C. Shimomura ◽  
Sylvio Barbon Junior ◽  
Daniel S. Kaster

Sign in / Sign up

Export Citation Format

Share Document