Black Box Models and Sociological Explanations: Predicting GPA Using Neural Networks

Mapping Intimacies ◽

10.31235/osf.io/7nsrf ◽

2017 ◽

Author(s):

Thomas Davidson

Keyword(s):

Network Models ◽

Predictive Performance ◽

Black Box ◽

Neural Network Models ◽

Grade Point ◽

Predictive Variables ◽

Box Models ◽

Model Finding ◽

Basic Network ◽

Predicting Gpa

The Fragile Families Challenge provided an opportunity to empirically assess the applicability of black box machine learning models to sociological questions and the extent to which interpretable explanations can be extracted from these models. In this paper I use neural network models to predict high school grade-point average and examine how variations of basic network parameters affect predictive performance. Using a recently proposed technique, I identify the most important predictive variables used by the best-performing model, finding that they relate to parenting and the child’s cognitive and behavioral development, consistent with prior work. I conclude by discussing the implications of these findings for the relationship between prediction and explanation in sociological analyses.

Download Full-text

Black-Box Models and Sociological Explanations: Predicting High School Grade Point Average Using Neural Networks

Socius Sociological Research for a Dynamic World ◽

10.1177/2378023118817702 ◽

2019 ◽

Vol 5 ◽

pp. 237802311881770 ◽

Cited By ~ 2

Author(s):

Thomas Davidson

Keyword(s):

High School ◽

Grade Point Average ◽

Network Models ◽

Predictive Performance ◽

Black Box ◽

School Grade ◽

Neural Network Models ◽

Grade Point ◽

Predictive Variables ◽

High School Grade

The Fragile Families Challenge provided an opportunity to empirically assess the applicability of black-box machine learning models to sociological questions and the extent to which interpretable explanations can be extracted from these models. In this article the author uses neural network models to predict high school grade point average and examines how variations of basic network parameters affect predictive performance. Using a recently proposed technique, the author identifies the most important predictive variables used by the best performing model, finding that they relate to parenting and the child’s cognitive and behavioral development, consistent with prior work. The author concludes by discussing the implications of these findings for the relationship between prediction and explanation in sociological analyses.

Download Full-text

Opening the black box of neural networks: methods for interpreting neural network models in clinical applications

Annals of Translational Medicine ◽

10.21037/atm.2018.05.32 ◽

2018 ◽

Vol 6 (11) ◽

pp. 216-216 ◽

Cited By ~ 45

Author(s):

Zhongheng Zhang ◽

◽

Marcus W. Beck ◽

David A. Winkler ◽

Bin Huang ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Models ◽

Black Box ◽

Clinical Applications ◽

Neural Network Models

Download Full-text

A comparison of the ability of black box and neural network models of ARX structure to represent a fluidized bed anaerobic digestion process

Water Research ◽

10.1016/s0043-1354(98)00287-5 ◽

1999 ◽

Vol 33 (4) ◽

pp. 1027-1037 ◽

Cited By ~ 11

Author(s):

G.C. Premier ◽

R. Dinsdale ◽

A.J. Guwy ◽

F.R. Hawkes ◽

D.L. Hawkes ◽

...

Keyword(s):

Neural Network ◽

Anaerobic Digestion ◽

Fluidized Bed ◽

Network Models ◽

Black Box ◽

Neural Network Models ◽

Anaerobic Digestion Process ◽

Digestion Process

Download Full-text

SOM-based aggregation for graph convolutional neural networks

Neural Computing and Applications ◽

10.1007/s00521-020-05484-4 ◽

2020 ◽

Author(s):

Luca Pasa ◽

Nicolò Navarin ◽

Alessandro Sperduti

Keyword(s):

Neural Network ◽

Network Models ◽

Predictive Performance ◽

Aggregation Operator ◽

Graph Representation ◽

Approximation Properties ◽

Neural Network Models ◽

Self Organizing Maps ◽

Node Level ◽

Real World Datasets

AbstractGraph property prediction is becoming more and more popular due to the increasing availability of scientific and social data naturally represented in a graph form. Because of that, many researchers are focusing on the development of improved graph neural network models. One of the main components of a graph neural network is the aggregation operator, needed to generate a graph-level representation from a set of node-level embeddings. The aggregation operator is critical since it should, in principle, provide a representation of the graph that is isomorphism invariant, i.e. the graph representation should be a function of graph nodes treated as a set. DeepSets (in: Advances in neural information processing systems, pp 3391–3401, 2017) provides a framework to construct a set-aggregation operator with universal approximation properties. In this paper, we propose a DeepSets aggregation operator, based on Self-Organizing Maps (SOM), to transform a set of node-level representations into a single graph-level one. The adoption of SOMs allows to compute node representations that embed the information about their mutual similarity. Experimental results on several real-world datasets show that our proposed approach achieves improved predictive performance compared to the commonly adopted sum aggregation and many state-of-the-art graph neural network architectures in the literature.

Download Full-text

Illuminating the Black Box: Interpreting Deep Neural Network Models for Psychiatric Research

Frontiers in Psychiatry ◽

10.3389/fpsyt.2020.551299 ◽

2020 ◽

Vol 11 ◽

Author(s):

Yi-han Sheu

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Network Models ◽

Black Box ◽

Psychiatric Research ◽

Neural Network Models

Download Full-text

The influence of unmeasured occupancy disturbances on the performance of black-box thermal building models

E3S Web of Conferences ◽

10.1051/e3sconf/202017202010 ◽

2020 ◽

Vol 172 ◽

pp. 02010

Author(s):

Louise Rævdal Lund Christensen ◽

Thea Hauge Broholt ◽

Michael Dahl Knudsen ◽

Rasmus Elbæk Hedegaard ◽

Steffen Petersen

Keyword(s):

State Space Model ◽

Predictive Performance ◽

Black Box ◽

Solar Irradiation ◽

Space Heating ◽

Economic Model Predictive Control ◽

Building Models ◽

Box Models ◽

Window Opening ◽

Mean Square Errors

Previous studies have identified a significant potential in using economic model predictive control for space heating. This type of control requires a thermodynamic model of the controlled building that maps certain controllable inputs (heat power) and measured disturbances (ambient temperature and solar irradiation) to the controlled output variable (room temperature). Occupancy related disturbances, such as people heat gains and venting through windows, are often completely ignored or assumed to be fully known (measured) in these studies. However, this assumption is usually not fulfilled in practice and the current simulation study investigated the consequences thereof. The results indicate that the predictive performance (root mean square errors) of a black-box state-space model is not significantly affected by ignoring people heat gains. On the other hand, the predictive performance was significantly improved by including window opening status as a model input. The performance of black-box models for MPC of space heating could therefore benefit from having inputs from sensors that tracks window opening.

Download Full-text

Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5849 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4264-4271

Author(s):

Siddhartha Jain ◽

Ge Liu ◽

Jonas Mueller ◽

David Gifford

Keyword(s):

Network Models ◽

Predictive Performance ◽

Training Data ◽

Bayesian Optimization ◽

Neural Network Models ◽

Training Techniques ◽

Uncertainty Estimates ◽

Data Density ◽

Adversarial Training ◽

Diverse Ensemble

The inaccuracy of neural network models on inputs that do not stem from the distribution underlying the training data is problematic and at times unrecognized. Uncertainty estimates of model predictions are often based on the variation in predictions produced by a diverse ensemble of models applied to the same input. Here we describe Maximize Overall Diversity (MOD), an approach to improve ensemble-based uncertainty estimates by encouraging larger overall diversity in ensemble predictions across all possible inputs. We apply MOD to regression tasks including 38 Protein-DNA binding datasets, 9 UCI datasets, and the IMDB-Wiki image dataset. We also explore variants that utilize adversarial training techniques and data density estimation. For out-of-distribution test examples, MOD significantly improves predictive performance and uncertainty calibration without sacrificing performance on test data drawn from same distribution as the training data. We also find that in Bayesian optimization tasks, the performance of UCB acquisition is improved via MOD uncertainty estimates.

Download Full-text

High Predictive Performance of Dynamic Neural Network Models for Forecasting Financial Time Series

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2019.0101289 ◽

2019 ◽

Vol 10 (12) ◽

Author(s):

Haya Alaskar

Keyword(s):

Neural Network ◽

Time Series ◽

Financial Time Series ◽

Network Models ◽

Predictive Performance ◽

Neural Network Models ◽

Dynamic Neural Network ◽

Financial Time

Download Full-text

Combining the Performance Strengths of the Logistic Regression and Neural Network Models: A Medical Outcomes Approach

The Scientific World JOURNAL ◽

10.1100/tsw.2003.35 ◽

2003 ◽

Vol 3 ◽

pp. 455-476 ◽

Cited By ~ 5

Author(s):

Wun Wong ◽

Peter J. Fos ◽

Frederick E. Petry

Keyword(s):

Neural Network ◽

Logistic Regression ◽

Disease Process ◽

Network Models ◽

Predictive Performance ◽

Medical Outcomes ◽

Neural Network Models ◽

The Neural Network ◽

Combined Use ◽

Logistic Regression Method

The assessment of medical outcomes is important in the effort to contain costs, streamline patient management, and codify medical practices. As such, it is necessary to develop predictive models that will make accurate predictions of these outcomes. The neural network methodology has often been shown to perform as well, if not better, than the logistic regression methodology in terms of sample predictive performance. However, the logistic regression method is capable of providing an explanation regarding the relationship(s) between variables. This explanation is often crucial to understanding the clinical underpinnings of the disease process. Given the respective strengths of the methodologies in question, the combined use of a statistical (i.e., logistic regression) and machine learning (i.e., neural network) technology in the classification of medical outcomes is warranted under appropriate conditions. The study discusses these conditions and describes an approach for combining the strengths of the models.

Download Full-text

Models for Predicting Development Effort of Small-Scale Visualization Projects

Journal of Intelligent Systems ◽

10.1515/jisys-2016-0247 ◽

2018 ◽

Vol 27 (3) ◽

pp. 413-431

Author(s):

M.A. Jayaram ◽

T.M. Kiran Kumar ◽

H.V. Raghavendra

Keyword(s):

Neural Network ◽

Linear Regression ◽

Network Models ◽

Small Scale ◽

Development Effort ◽

Software Project ◽

Effort Estimation ◽

Neural Network Models ◽

Grade Point ◽

Postgraduate Students

Abstract Software project effort estimation is one of the important aspects of software engineering. Researchers in this area are still striving hard to come out with the best predictive model that has befallen as a greatest challenge. In this work, the effort estimation for small-scale visualization projects all rendered on engineering, general science, and other allied areas developed by 60 postgraduate students in a supervised academic setting is modeled by three approaches, namely, linear regression, quadratic regression, and neural network. Seven unique parameters, namely, number of lines of code (LOC), new and change code (N&C), reuse code (R), cumulative grade point average (CGPA), cyclomatic complexity (CC), algorithmic complexity (AC), and function points (FP), which are considered to be influential in software development effort, are elicited along with actual effort. The three models are compared with respect to their prediction accuracy via the magnitude of error relative to the estimate (MER) for each project and also its mean MER (MMER) in all the projects in both the verification and validation phases. Evaluations of the models have shown MMER of 0.002, 0.006, and 0.009 during verification and 0.006, 0.002, and 0.002 during validation for the multiple linear regression, nonlinear regression, and neural network models, respectively. Thus, the marginal differences in the error estimates have indicated that the three models can be alternatively used for effort computation specific to visualization projects. Results have also suggested that parameters such as LOC, N&C, R, CC, and AC have a direct influence on effort prediction, whereas CGPA has an inverse relationship. FP seems to be neutral as far as visualization projects are concerned.

Download Full-text