SCALABLE ALGORITHMS FOR BICHROMATIC LINE SEGMENT INTERSECTION PROBLEMS ON COARSE GRAINED MULTICOMPUTERS

1996 ◽  
Vol 06 (04) ◽  
pp. 487-506 ◽  
Author(s):  
ANDREAS FABRI ◽  
OLIVIER DEVILLERS

We present output-sensitive scalable parallel algorithms for bichromatic line segment intersection problems for the coarse grained multicomputer model. Under the assumption that n≥p2, where n is the number of line segments and p the number of processors, we obtain an intersection counting algorithm with a time complexity of [Formula: see text], where Ts(m, p) is the time used to sort m items on a p processor machine. The first term captures the time spent in sequential computation performed locally by each processor. The second term captures the interprocessor communication time. An additional [Formula: see text] time in sequential computation is spent on the reporting of the k intersections. As the sequential time complexity is O(n log n) for counting and an additional time O(k) for reporting, we obtain a speedup of [Formula: see text] in the sequential part of the algorithm. The speedup in the communication part obviously depends on the underlying architecture. For example for a hypercube it ranges between [Formula: see text] and [Formula: see text] depending on the ratio of n and p. As the reporting does not involve more interprocessor communication than the counting, the algorithm achieves a full speedup of p for k≥ O( max (n log n log p, n log 3 p)) even on a hypercube.

2017 ◽  
Vol 83 (4) ◽  
Author(s):  
J. Guadagni ◽  
A. J. Cerfon

We present a fast and spectrally accurate numerical scheme for the evaluation of the gyroaveraged electrostatic potential in non-periodic gyrokinetic-Poisson simulations. Our method relies on a reformulation of the gyrokinetic-Poisson system in which the gyroaverage in Poisson’s equation is computed for the compactly supported charge density instead of the non-periodic, non-compactly supported potential itself. We calculate this gyroaverage with a combination of two Fourier transforms and a Hankel transform, which has the near optimal run-time complexity$O(N_{\unicode[STIX]{x1D70C}}(P+\hat{P})\log (P+\hat{P}))$, where$P$is the number of spatial grid points,$\hat{P}$the number of grid points in Fourier space and$N_{\unicode[STIX]{x1D70C}}$the number of grid points in velocity space. We present numerical examples illustrating the performance of our code and demonstrating geometric convergence of the error.


1997 ◽  
pp. 19-43
Author(s):  
Mark de Berg ◽  
Marc van Kreveld ◽  
Mark Overmars ◽  
Otfried Schwarzkopf

2003 ◽  
Vol 14 (01) ◽  
pp. 79-98
Author(s):  
CHIN-HSIUNG WU ◽  
SHI-JINN HORNG

The computational model on which the algorithms are developed is the array with reconfigurable optical buses (AROB). It integrates the advantages of both optical transmission and electronic computation. The main contributions of this paper are in designing several optimal and/or optimal speed-up template matching algorithms with varying degrees of parallelism on the AROB model. For an N × N digitized image and an M × M template, when the domains of the image and the template are O( log N)-bit integers, we first design several basic operations for window broadcasting and rotation. Then based on these basic operations, three efficient and scalable algorithms for template matching are derived using various numbers of processors on a two-dimensional (2-D) or 3-D AROB. For 1 ≤ r ≤ N, 1 ≤ p ≤ M ≤ q ≤ N, one runs in [Formula: see text] time using r × r processors, another runs in [Formula: see text], (resp. [Formula: see text]) time using pN × pN/ log M (resp. pN × pN × log N) processors, and the other runs in [Formula: see text] (resp. [Formula: see text]) time using pq × pq/ log M (or pq × pqN × log N) processors, respectively. The latter two algorithms can be tuned to run in O(1) time on a 2-D AROB. To the best of our knowledge, there are no algorithms which can reach this time complexity for this problem on a 2-D array architecture.


Computing ◽  
2005 ◽  
Vol 75 (4) ◽  
pp. 337-357 ◽  
Author(s):  
Y.-K. Zhu ◽  
J.-H. Yong ◽  
G.-Q. Zheng

1999 ◽  
Vol 09 (04) ◽  
pp. 539-550 ◽  
Author(s):  
JEAN CARLE ◽  
JEAN-FREDERIC MYOUPO ◽  
DAVID SEME

This paper presents two simple all-to-all broadcasting algorithms on honeycomb mesh. Consider a network with n processors, one has personalized routing strategy at each node and it requires a 3n communication time complexity. This communication time can be reduced to n because the computation time is always assumed to be much lower than the communication time. The other is based on a Hamiltonian path and has a 2n communication time complexity. We show how they can be used to get parallel solutions to a class of problems on honeycomb networks, among others Prefix Sums, Maximal Vectors, Maximal Sum Subsegment, Parenthesis Matching, Decoding Binary Tree, and Sorting. In our knowledge, these all-to-all broadcast algorithms are the only ones so far exhibited on a honeycomb.


Sign in / Sign up

Export Citation Format

Share Document