High Throughput Ultrasonic Multi-implant Readout Using a Machine-Learning Assisted CDMA Receiver

This review article presents the state-of-the-art in high-throughput computational and experimental screening routines with application in organic solar cells, including materials discovery, device optimization and machine-learning algorithms.

Download Full-text

FRI0585 HIGH-THROUGHPUT METHODOLOGY FOR EMR-BASED IDENTIFICATION OF CLINICAL SUB-PHENOTYPES IN COMPLEX PATIENT POPULATIONS

Annals of the Rheumatic Diseases ◽

10.1136/annrheumdis-2020-eular.3489 ◽

2020 ◽

Vol 79 (Suppl 1) ◽

pp. 897.2-897

Author(s):

M. Maurits ◽

T. Huizinga ◽

M. Reinders ◽

S. Raychaudhuri ◽

E. Karlson ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Dimensionality Reduction ◽

High Throughput ◽

Brain Cancer ◽

Machine Learning Techniques ◽

Summary Statistics ◽

Medical Problems ◽

Learning Techniques ◽

Icd Codes

Background:Heterogeneity in disease populations complicates discovery of risk factors. To identify risk factors for subpopulations of diseases, we need analytical methods that can deal with unidentified disease subgroups.Objectives:Inspired by successful approaches from the Big Data field, we developed a high-throughput approach to identify subpopulations within patients with heterogeneous, complex diseases using the wealth of information available in Electronic Medical Records (EMRs).Methods:We extracted longitudinal healthcare-interaction records coded by 1,853 PheCodes[1] of the 64,819 patients from the Boston’s Partners-Biobank. Through dimensionality reduction using t-SNE[2] we created a 2D embedding of 32,424 of these patients (set A). We then identified distinct clusters post-t-SNE using DBscan[3] and visualized the relative importance of individual PheCodes within them using specialized spectrographs. We replicated this procedure in the remaining 32,395 records (set B).Results:Summary statistics of both sets were comparable (Table 1).Table 1.Summary statistics of the total Partners Biobank dataset and the 2 partitions.Set-Aset-BTotalEntries12,200,31112,177,13124,377,442Patients32,42432,39564,819Patientyears369,546.33368,597.92738,144.2unique ICD codes25,05624,95326,305unique Phecodes1,8511,8531,853We found 284 clusters in set A and 295 in set B, of which 63.4% from set A could be mapped to a cluster in set B with a median (range) correlation of 0.24 (0.03 – 0.58).Clusters represented similar yet distinct clinical phenotypes; e.g. patients diagnosed with “other headache syndrome” were separated into four distinct clusters characterized by migraines, neurofibromatosis, epilepsy or brain cancer, all resulting in patients presenting with headaches (Fig. 1 & 2). Though EMR databases tend to be noisy, our method was also able to differentiate misclassification from true cases; SLE patients with RA codes clustered separately from true RA cases.Figure 1.Two dimensional representation of Set A generated using dimensionality reduction (tSNE) and clustering (DBScan).Figure 2.Phenotype Spectrographs (PheSpecs) of four clusters characterized by “Other headache syndromes”, driven by codes relating to migraine, epilepsy, neurofibromatosis or brain cancer.Conclusion:We have shown that EMR data can be used to identify and visualize latent structure in patient categorizations, using an approach based on dimension reduction and clustering machine learning techniques. Our method can identify misclassified patients as well as separate patients with similar problems into subsets with different associated medical problems. Our approach adds a new and powerful tool to aid in the discovery of novel risk factors in complex, heterogeneous diseases.References:[1] Denny, J.C. et al. Bioinformatics (2010)[2]van der Maaten et al. Journal of Machine Learning Research (2008)[3] Ester, M. et al. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. (1996)Disclosure of Interests:Marc Maurits: None declared, Thomas Huizinga Grant/research support from: Ablynx, Bristol-Myers Squibb, Roche, Sanofi, Consultant of: Ablynx, Bristol-Myers Squibb, Roche, Sanofi, Marcel Reinders: None declared, Soumya Raychaudhuri: None declared, Elizabeth Karlson: None declared, Erik van den Akker: None declared, Rachel Knevel: None declared

Download Full-text

First-principles data integrated machine learning approach for high-throughput searching of ternary electrocatalyst toward oxygen reduction reaction

Chem Catalysis ◽

10.1016/j.checat.2021.06.001 ◽

2021 ◽

Author(s):

Hoje Chun ◽

Eunjik Lee ◽

Kyungju Nam ◽

Ji-Hoon Jang ◽

Woomin Kyoung ◽

...

Keyword(s):

Machine Learning ◽

Oxygen Reduction Reaction ◽

Oxygen Reduction ◽

High Throughput ◽

First Principles ◽

Reduction Reaction ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text

Machine learning methodology for high throughput personalized neutron dose reconstruction in mixed neutron + photon exposures

Scientific Reports ◽

10.1038/s41598-021-83575-5 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Igor Shuryak ◽

Helen C. Turner ◽

Monica Pujol-Canadell ◽

Jay R. Perrier ◽

Guy Garty ◽

...

Keyword(s):

Machine Learning ◽

High Throughput ◽

Mean Squared Error ◽

Ex Vivo ◽

Probability Distributions ◽

Micronucleus Assay ◽

Neutron Dose ◽

Cell Probability ◽

Scanning Imaging ◽

Automated Scanning

AbstractWe implemented machine learning in the radiation biodosimetry field to quantitatively reconstruct neutron doses in mixed neutron + photon exposures, which are expected in improvised nuclear device detonations. Such individualized reconstructions are crucial for triage and treatment because neutrons are more biologically damaging than photons. We used a high-throughput micronucleus assay with automated scanning/imaging on lymphocytes from human blood ex-vivo irradiated with 44 different combinations of 0–4 Gy neutrons and 0–15 Gy photons (542 blood samples), which include reanalysis of past experiments. We developed several metrics that describe micronuclei/cell probability distributions in binucleated cells, and used them as predictors in random forest (RF) and XGboost machine learning analyses to reconstruct the neutron dose in each sample. The probability of “overfitting” was minimized by training both algorithms with repeated cross-validation on a randomly-selected subset of the data, and measuring performance on the rest. RF achieved the best performance. Mean R2 for actual vs. reconstructed neutron doses over 300 random training/testing splits was 0.869 (range 0.761 to 0.919) and root mean squared error was 0.239 (0.195 to 0.351) Gy. These results demonstrate the promising potential of machine learning to reconstruct the neutron dose component in clinically-relevant complex radiation exposure scenarios.

Download Full-text

Accelerating the discovery of energetic melt-castable materials by a high-throughput virtual screening and experimental approach

Journal of Materials Chemistry A ◽

10.1039/d1ta04441a ◽

2021 ◽

Author(s):

Siwei Song ◽

Fang Chen ◽

Yi Wang ◽

Kangcai Wang ◽

Mi Yan ◽

...

Keyword(s):

Machine Learning ◽

Virtual Screening ◽

High Throughput ◽

Experimental Approach ◽

Chemical Data ◽

Research Paradigm ◽

New Materials ◽

High Throughput Virtual Screening

With the growth of chemical data, computation power and algorithms, machine learning-assisted high-throughput virtual screening (ML-assisted HTVS) is revolutionizing the research paradigm of new materials. Herein, a combined ML-assisted HTVS...

Download Full-text

A deep neural network model for packing density predictions and its application in the study of 1.5 million organic molecules

Chemical Science ◽

10.1039/c9sc02677k ◽

2019 ◽

Vol 10 (36) ◽

pp. 8374-8383 ◽

Cited By ~ 1

Author(s):

Mohammad Atif Faiz Afzal ◽

Aditya Sonpal ◽

Mojtaba Haghighatlari ◽

Andrew J. Schultz ◽

Johannes Hachmann

Keyword(s):

Neural Network ◽

Machine Learning ◽

Refractive Index ◽

High Throughput ◽

Neural Network Model ◽

High Throughput Screening ◽

Deep Neural Network ◽

Organic Molecules ◽

High Refractive Index ◽

Computational Pipeline

Computational pipeline for the accelerated discovery of organic materials with high refractive index via high-throughput screening and machine learning.

Download Full-text

Predicting carbon nanotube forest attributes and mechanical properties using simulated images and deep learning

npj Computational Materials ◽

10.1038/s41524-021-00603-8 ◽

2021 ◽

Vol 7 (1) ◽

Cited By ~ 1

Author(s):

Taher Hajilounezhad ◽

Rina Bao ◽

Kannappan Palaniappan ◽

Filiz Bunyak ◽

Prasad Calyam ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Carbon Nanotube ◽

High Throughput ◽

Self Assembly ◽

Mechanical Performance ◽

Physical Parameters ◽

Structure Property ◽

Processing Parameter ◽

Physics Based Simulation

AbstractUnderstanding and controlling the self-assembly of vertically oriented carbon nanotube (CNT) forests is essential for realizing their potential in myriad applications. The governing process–structure–property mechanisms are poorly understood, and the processing parameter space is far too vast to exhaustively explore experimentally. We overcome these limitations by using a physics-based simulation as a high-throughput virtual laboratory and image-based machine learning to relate CNT forest synthesis attributes to their mechanical performance. Using CNTNet, our image-based deep learning classifier module trained with synthetic imagery, combinations of CNT diameter, density, and population growth rate classes were labeled with an accuracy of >91%. The CNTNet regression module predicted CNT forest stiffness and buckling load properties with a lower root-mean-square error than that of a regression predictor based on CNT physical parameters. These results demonstrate that image-based machine learning trained using only simulated imagery can distinguish subtle CNT forest morphological features to predict physical material properties with high accuracy. CNTNet paves the way to incorporate scanning electron microscope imagery for high-throughput material discovery.

Download Full-text

Applications of Metabolomics in Forensic Toxicology and Forensic Medicine

International Journal of Molecular Sciences ◽

10.3390/ijms22063010 ◽

2021 ◽

Vol 22 (6) ◽

pp. 3010

Author(s):

Michal Szeremeta ◽

Karolina Pietrowska ◽

Anna Niemcunowicz-Janica ◽

Adam Kretowski ◽

Michal Ciborowski

Keyword(s):

Machine Learning ◽

Multivariate Statistics ◽

High Throughput ◽

Forensic Medicine ◽

Forensic Toxicology ◽

Metabolic Changes ◽

Untargeted Metabolomics ◽

Criminal Cases ◽

Statistical Approaches

Forensic toxicology and forensic medicine are unique among all other medical fields because of their essential legal impact, especially in civil and criminal cases. New high-throughput technologies, borrowed from chemistry and physics, have proven that metabolomics, the youngest of the “omics sciences”, could be one of the most powerful tools for monitoring changes in forensic disciplines. Metabolomics is a particular method that allows for the measurement of metabolic changes in a multicellular system using two different approaches: targeted and untargeted. Targeted studies are focused on a known number of defined metabolites. Untargeted metabolomics aims to capture all metabolites present in a sample. Different statistical approaches (e.g., uni- or multivariate statistics, machine learning) can be applied to extract useful and important information in both cases. This review aims to describe the role of metabolomics in forensic toxicology and in forensic medicine.

Download Full-text

High-throughput brain activity mapping and machine learning as a foundation for systems neuropharmacology

Nature Communications ◽

10.1038/s41467-018-07289-5 ◽

2018 ◽

Vol 9 (1) ◽

Cited By ~ 9

Author(s):

Xudong Lin ◽

Xin Duan ◽

Claire Jacobs ◽

Jeremy Ullmann ◽

Chung-Yuen Chan ◽

...

Keyword(s):

Machine Learning ◽

High Throughput ◽

Brain Activity

Download Full-text

Machine Learning and High-Throughput Approaches to Magnetism

Handbook of Materials Modeling ◽

10.1007/978-3-319-44680-6_108 ◽

2020 ◽

pp. 351-373 ◽

Cited By ~ 1

Author(s):

Stefano Sanvito ◽

M. Žic ◽

J. Nelson ◽

T. Archer ◽

C. Oses ◽

...

Keyword(s):

Machine Learning ◽

High Throughput

Download Full-text