scholarly journals Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code

2015 ◽  
Vol 50 (9) ◽  
pp. 205-217 ◽  
Author(s):  
Michel Steuwer ◽  
Christian Fensch ◽  
Sam Lindley ◽  
Christophe Dubach
2020 ◽  
Author(s):  
James McDonagh ◽  
William Swope ◽  
Richard L. Anderson ◽  
Michael Johnston ◽  
David J. Bray

Digitization offers significant opportunities for the formulated product industry to transform the way it works and develop new methods of business. R&D is one area of operation that is challenging to take advantage of these technologies due to its high level of domain specialisation and creativity but the benefits could be significant. Recent developments of base level technologies such as artificial intelligence (AI)/machine learning (ML), robotics and high performance computing (HPC), to name a few, present disruptive and transformative technologies which could offer new insights, discovery methods and enhanced chemical control when combined in a digital ecosystem of connectivity, distributive services and decentralisation. At the fundamental level, research in these technologies has shown that new physical and chemical insights can be gained, which in turn can augment experimental R&D approaches through physics-based chemical simulation, data driven models and hybrid approaches. In all of these cases, high quality data is required to build and validate models in addition to the skills and expertise to exploit such methods. In this article we give an overview of some of the digital technology demonstrators we have developed for formulated product R&D. We discuss the challenges in building and deploying these demonstrators.<br>


2020 ◽  
Vol 12 (2) ◽  
pp. 19-50 ◽  
Author(s):  
Muhammad Siddique ◽  
Shandana Shoaib ◽  
Zahoor Jan

A key aspect of work processes in service sector firms is the interconnection between tasks and performance. Relational coordination can play an important role in addressing the issues of coordinating organizational activities due to high level of interdependence complexity in service sector firms. Research has primarily supported the aspect that well devised high performance work systems (HPWS) can intensify organizational performance. There is a growing debate, however, with regard to understanding the “mechanism” linking HPWS and performance outcomes. Using relational coordination theory, this study examines a model that examine the effects of subsets of HPWS, such as motivation, skills and opportunity enhancing HR practices on relational coordination among employees working in reciprocal interdependent job settings. Data were gathered from multiple sources including managers and employees at individual, functional and unit levels to know their understanding in relation to HPWS and relational coordination (RC) in 218 bank branches in Pakistan. Data analysis via structural equation modelling, results suggest that HPWS predicted RC among officers at the unit level. The findings of the study have contributions to both, theory and practice.


2021 ◽  
pp. 1-7
Author(s):  
Haniel Fernandes

<b><i>Background:</i></b> Soccer is an extremely competitive sport, where the most match important moments can be defined in detail. Use of ergogenic supplements can be crucial to improve the performance of a high-performance athlete. Therefore, knowing which ergogenic supplements are important for soccer players can be an interesting strategy to maintain high level in this sport until final and decisive moments of the match. In addition, other supplements, such as dietary supplements, have been studied and increasingly referenced in the scientific literature. But, what if ergogenic supplements were combined with dietary supplements? This review brings some recommendations to improve performance of soccer athletes on the field through dietary and/or ergogenic supplements that can be used simultaneously. <b><i>Summary:</i></b> Soccer is a competitive sport, where the match important moments can be defined in detail. Thus, use of ergogenic supplements covered in this review can improve performance of elite soccer players maintaining high level in the match until final moments, such as creatine 3–5 g day<sup>−1</sup>, caffeine 3–6 mg kg<sup>−1</sup> BW around 60 min before the match, sodium bicarbonate 0.1–0.4 g kg<sup>−1</sup> BW starting from 30 to 180 min before the match, β-alanine 3.2 and 6.4 g day<sup>−1</sup> provided in the sustained-release tablets divided into 4 times a day, and nitrate-rich beetroot juice 60 g in 200 mL of water (6 mmol of NO3<sup>−</sup> L) around 120 min before match or training, including a combination possible with taurine 50 mg kg<sup>−1</sup> BW day<sup>−1</sup>, citrulline 1.2–3.4 g day<sup>−1</sup>, and arginine 1.2–6 g day<sup>−1</sup>. <b><i>Key Messages:</i></b> Soccer athletes can combine ergogenic and dietary supplements to improve their performance on the field. The ergogenic and dietary supplements used in a scientifically recommended dose did not demonstrate relevant side effects. The use of various evidence-based supplements can add up to further improvement in the performance of the elite soccer players.


Diversity ◽  
2021 ◽  
Vol 13 (2) ◽  
pp. 71
Author(s):  
Charalampos Dimitriadis ◽  
Ivoni Fournari-Konstantinidou ◽  
Laurent Sourbès ◽  
Drosos Koutsoubas ◽  
Stelios Katsanevakis

Understanding the interactions among invasive species, native species and marine protected areas (MPAs), and the long-term regime shifts in MPAs is receiving increased attention, since biological invasions can alter the structure and functioning of the protected ecosystems and challenge conservation efforts. Here we found evidence of marked modifications in the rocky reef associated biota in a Mediterranean MPA from 2009 to 2019 through visual census surveys, due to the presence of invasive species altering the structure of the ecosystem and triggering complex cascading effects on the long term. Low levels of the populations of native high-level predators were accompanied by the population increase and high performance of both native and invasive fish herbivores. Subsequently the overgrazing and habitat degradation resulted in cascading effects towards the diminishing of the native and invasive invertebrate grazers and omnivorous benthic species. Our study represents a good showcase of how invasive species can coexist or exclude native biota and at the same time regulate or out-compete other established invaders and native species.


Author(s):  
Umar Ibrahim Minhas ◽  
Roger Woods ◽  
Georgios Karakonstantis

AbstractWhilst FPGAs have been used in cloud ecosystems, it is still extremely challenging to achieve high compute density when mapping heterogeneous multi-tasks on shared resources at runtime. This work addresses this by treating the FPGA resource as a service and employing multi-task processing at the high level, design space exploration and static off-line partitioning in order to allow more efficient mapping of heterogeneous tasks onto the FPGA. In addition, a new, comprehensive runtime functional simulator is used to evaluate the effect of various spatial and temporal constraints on both the existing and new approaches when varying system design parameters. A comprehensive suite of real high performance computing tasks was implemented on a Nallatech 385 FPGA card and show that our approach can provide on average 2.9 × and 2.3 × higher system throughput for compute and mixed intensity tasks, while 0.2 × lower for memory intensive tasks due to external memory access latency and bandwidth limitations. The work has been extended by introducing a novel scheduling scheme to enhance temporal utilization of resources when using the proposed approach. Additional results for large queues of mixed intensity tasks (compute and memory) show that the proposed partitioning and scheduling approach can provide higher than 3 × system speedup over previous schemes.


Author(s):  
Breno A. de Melo Menezes ◽  
Nina Herrmann ◽  
Herbert Kuchen ◽  
Fernando Buarque de Lima Neto

AbstractParallel implementations of swarm intelligence algorithms such as the ant colony optimization (ACO) have been widely used to shorten the execution time when solving complex optimization problems. When aiming for a GPU environment, developing efficient parallel versions of such algorithms using CUDA can be a difficult and error-prone task even for experienced programmers. To overcome this issue, the parallel programming model of Algorithmic Skeletons simplifies parallel programs by abstracting from low-level features. This is realized by defining common programming patterns (e.g. map, fold and zip) that later on will be converted to efficient parallel code. In this paper, we show how algorithmic skeletons formulated in the domain specific language Musket can cope with the development of a parallel implementation of ACO and how that compares to a low-level implementation. Our experimental results show that Musket suits the development of ACO. Besides making it easier for the programmer to deal with the parallelization aspects, Musket generates high performance code with similar execution times when compared to low-level implementations.


Electronics ◽  
2021 ◽  
Vol 10 (5) ◽  
pp. 627
Author(s):  
David Marquez-Viloria ◽  
Luis Castano-Londono ◽  
Neil Guerrero-Gonzalez

A methodology for scalable and concurrent real-time implementation of highly recurrent algorithms is presented and experimentally validated using the AWS-FPGA. This paper presents a parallel implementation of a KNN algorithm focused on the m-QAM demodulators using high-level synthesis for fast prototyping, parameterization, and scalability of the design. The proposed design shows the successful implementation of the KNN algorithm for interchannel interference mitigation in a 3 × 16 Gbaud 16-QAM Nyquist WDM system. Additionally, we present a modified version of the KNN algorithm in which comparisons among data symbols are reduced by identifying the closest neighbor using the rule of the 8-connected clusters used for image processing. Real-time implementation of the modified KNN on a Xilinx Virtex UltraScale+ VU9P AWS-FPGA board was compared with the results obtained in previous work using the same data from the same experimental setup but offline DSP using Matlab. The results show that the difference is negligible below FEC limit. Additionally, the modified KNN shows a reduction of operations from 43 percent to 75 percent, depending on the symbol’s position in the constellation, achieving a reduction 47.25% reduction in total computational time for 100 K input symbols processed on 20 parallel cores compared to the KNN algorithm.


2006 ◽  
Vol 72 (6) ◽  
pp. 3924-3932 ◽  
Author(s):  
Erik Lys�e ◽  
Sonja S. Klemsdal ◽  
Karen R. Bone ◽  
Rasmus J. N. Frandsen ◽  
Thomas Johansen ◽  
...  

ABSTRACT Zearalenones are produced by several Fusarium species and can cause reproductive problems in animals. Some aurofusarin mutants of Fusarium pseudograminearum produce elevated levels of zearalenone (ZON), one of the estrogenic mycotoxins comprising the zearalenones. An analysis of transcripts from polyketide synthase genes identified in the Fusarium graminearum database was carried out for these mutants. PKS4 was the only gene with an enoyl reductase domain that had a higher level of transcription in the aurofusarin mutants than in the wild type. An Agrobacterium tumefaciens-mediated transformation protocol was used to replace the central part of the PKS4 gene with a hygB resistance gene through double homologous recombination in an F. graminearum strain producing a high level of ZON. PCR and Southern analysis of transformants were used to identify isolates with single insertional replacements of PKS4. High-performance liquid chromatography analysis showed that the PKS4 replacement mutant did not produce ZON. Thus, PKS4 encodes an enzyme required for the production of ZON in F. graminearum. Barley root infection studies revealed no alteration in the pathogenicity of the PKS4 mutant compared to the pathogenicity of the wild type. The expression of PKS13, which is located in the same cluster as PKS4, decreased dramatically in the mutant, while transcription of PKS4 was unchanged. This differential expression may indicate that ZON or its derivatives do not regulate expression of PKS4 and that the PKS4-encoded protein or its product stimulates expression of PKS13. Furthermore, both the lack of aurofusarin and ZON influenced the expression of other polyketide synthases, demonstrating that one polyketide can influence the expression of others.


2016 ◽  
Vol 256 ◽  
pp. 319-327 ◽  
Author(s):  
Mario Rosso ◽  
Ildiko Peter ◽  
Ivano Gattelli

During the last decades under the enthusiastic and competent guidance of Mr Chiarmetta SSM processes attained in Italy at Stampal Spa (Torino) an unquestionable high level of industrial development with the production of large numbers of high performance automotive parts, like variety of suspension support, engine suspension mounts, steering knuckle, front suspension wheel, arm and rear axle. Among the most highlighted findings SSM processes demonstrated their capability to reduce the existing gap between casting and forging, moreover during such a processes there are the opportunity to better control the defect level.Purpose of this paper is to highlight the research work and the SSM industrial production attained and developed by Mr G.L. Chiarmetta, as well as to give an overview concerning some alternative methods for the production of enhanced performance light alloys components for critical industrial applications and to present an analysis of a new rheocasting process suitable for the manufacturing of high performance industrial components.


Sign in / Sign up

Export Citation Format

Share Document