Fast Function Approximation with Hierarchical Neural Networks and Their Application to a Reinforcement Learning Agent

Location- and Person-Independent Activity Recognition with WiFi, Deep Neural Networks, and Reinforcement Learning

ACM Transactions on Internet of Things ◽

10.1145/3424739 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Yongsen Ma ◽

Sheheryar Arshad ◽

Swetha Muniraju ◽

Eric Torkildson ◽

Enrico Rantala ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reinforcement Learning ◽

Activity Recognition ◽

Deep Neural Networks ◽

State Machine ◽

Recognition Algorithm ◽

The State ◽

Neural Architecture ◽

Learning Agent

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2317 ◽

2020 ◽

Vol 53 (2) ◽

pp. 8181-8186

Author(s):

Haonan Chang ◽

Zhuo Xu ◽

Masayoshi Tomizuka

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Learning Control ◽

Control Policies ◽

Hierarchical Neural Networks

Download Full-text

Multilevel Data Classification and Function Approximation Using Hierarchical Neural Networks

Computational Methods for the Innovative Design of Electrical Devices - Studies in Computational Intelligence ◽

10.1007/978-3-642-16225-1_8 ◽

2010 ◽

pp. 147-166

Author(s):

M. Alper Selver ◽

Cüneyt Güzeliş

Keyword(s):

Neural Networks ◽

Function Approximation ◽

Data Classification ◽

Multilevel Data ◽

Hierarchical Neural Networks ◽

And Function

Download Full-text

Efficient Value Function Approximation with Unsupervised Hierarchical Categorization for a Reinforcement Learning Agent

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology ◽

10.1109/wi-iat.2010.16 ◽

2010 ◽

Cited By ~ 1

Author(s):

Yongjia Wang ◽

John E. Laird

Keyword(s):

Reinforcement Learning ◽

Function Approximation ◽

Value Function ◽

Value Function Approximation ◽

Learning Agent ◽

Hierarchical Categorization

Download Full-text

Hybrid Reinforcement Learning with Expert State Sequences

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013739 ◽

2019 ◽

Vol 33 ◽

pp. 3739-3746 ◽

Cited By ~ 2

Author(s):

Xiaoxiao Guo ◽

Shiyu Chang ◽

Mo Yu ◽

Gerald Tesauro ◽

Murray Campbell

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Hybrid Approach ◽

Imitation Learning ◽

Learning Approaches ◽

Inference Model ◽

Learning Agent ◽

Hybrid Reinforcement ◽

Policy Optimization

Existing imitation learning approaches often require that the complete demonstration data, including sequences of actions and states, are available. In this paper, we consider a more realistic and difficult scenario where a reinforcement learning agent only has access to the state sequences of an expert, while the expert actions are unobserved. We propose a novel tensor-based model to infer the unobserved actions of the expert state sequences. The policy of the agent is then optimized via a hybrid objective combining reinforcement learning and imitation learning. We evaluated our hybrid approach on an illustrative domain and Atari games. The empirical results show that (1) the agents are able to leverage state expert sequences to learn faster than pure reinforcement learning baselines, (2) our tensor-based action inference model is advantageous compared to standard deep neural networks in inferring expert actions, and (3) the hybrid policy optimization objective is robust against noise in expert state sequences.

Download Full-text

On the Use of Function Approximation Potential of Artificial Neural Networks for Predicting Elastic Moduli of Binary Oxide Glass Systems

Global Journal For Research Analysis ◽

10.15373/22778160/february2014/60 ◽

2012 ◽

Vol 3 (2) ◽

pp. 186-189

Author(s):

Dr.R.Sheelarani Dr.R.Sheelarani ◽

◽

Dr. K.T. Arulmozhi Dr. K.T. Arulmozhi

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Function Approximation ◽

Elastic Moduli ◽

Binary Oxide ◽

Oxide Glass ◽

Artificial Neural

Download Full-text

An Evaluation Methodology for Interactive Reinforcement Learning with Simulated Users

Biomimetics ◽

10.3390/biomimetics6010013 ◽

2021 ◽

Vol 6 (1) ◽

pp. 13

Author(s):

Adam Bignold ◽

Francisco Cruz ◽

Richard Dazeley ◽

Peter Vamplew ◽

Cameron Foale

Keyword(s):

Reinforcement Learning ◽

Information Source ◽

Human Interaction ◽

Evaluation Methodology ◽

External Information ◽

Preliminary Evaluation ◽

Learning Agents ◽

Learning Agent ◽

Knowledge Bias ◽

The Impact

Interactive reinforcement learning methods utilise an external information source to evaluate decisions and accelerate learning. Previous work has shown that human advice could significantly improve learning agents’ performance. When evaluating reinforcement learning algorithms, it is common to repeat experiments as parameters are altered or to gain a sufficient sample size. In this regard, to require human interaction every time an experiment is restarted is undesirable, particularly when the expense in doing so can be considerable. Additionally, reusing the same people for the experiment introduces bias, as they will learn the behaviour of the agent and the dynamics of the environment. This paper presents a methodology for evaluating interactive reinforcement learning agents by employing simulated users. Simulated users allow human knowledge, bias, and interaction to be simulated. The use of simulated users allows the development and testing of reinforcement learning agents, and can provide indicative results of agent performance under defined human constraints. While simulated users are no replacement for actual humans, they do offer an affordable and fast alternative for evaluative assisted agents. We introduce a method for performing a preliminary evaluation utilising simulated users to show how performance changes depending on the type of user assisting the agent. Moreover, we describe how human interaction may be simulated, and present an experiment illustrating the applicability of simulating users in evaluating agent performance when assisted by different types of trainers. Experimental results show that the use of this methodology allows for greater insight into the performance of interactive reinforcement learning agents when advised by different users. The use of simulated users with varying characteristics allows for evaluation of the impact of those characteristics on the behaviour of the learning agent.

Download Full-text

Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning

Algorithms ◽

10.3390/a14010026 ◽

2021 ◽

Vol 14 (1) ◽

pp. 26

Author(s):

Yiran Xue ◽

Rui Wu ◽

Jiafeng Liu ◽

Xianglong Tang

Keyword(s):

Reinforcement Learning ◽

Guidance System ◽

Force Model ◽

Interactive Simulation ◽

Social Force ◽

Novel Approach ◽

Learning Agent ◽

Network Output ◽

Combined Action ◽

Crowd Evacuation

Existing crowd evacuation guidance systems require the manual design of models and input parameters, incurring a significant workload and a potential for errors. This paper proposed an end-to-end intelligent evacuation guidance method based on deep reinforcement learning, and designed an interactive simulation environment based on the social force model. The agent could automatically learn a scene model and path planning strategy with only scene images as input, and directly output dynamic signage information. Aiming to solve the “dimension disaster” phenomenon of the deep Q network (DQN) algorithm in crowd evacuation, this paper proposed a combined action-space DQN (CA-DQN) algorithm that grouped Q network output layer nodes according to action dimensions, which significantly reduced the network complexity and improved system practicality in complex scenes. In this paper, the evacuation guidance system is defined as a reinforcement learning agent and implemented by the CA-DQN method, which provides a novel approach for the evacuation guidance problem. The experiments demonstrate that the proposed method is superior to the static guidance method, and on par with the manually designed model method.

Download Full-text

Optimal function approximation with ReLU neural networks

Neurocomputing ◽

10.1016/j.neucom.2021.01.007 ◽

2021 ◽

Vol 435 ◽

pp. 216-227

Author(s):

Bo Liu ◽

Yi Liang

Keyword(s):

Neural Networks ◽

Function Approximation ◽

Optimal Function

Download Full-text

Autonomous reinforcement learning agent for chemical vapor deposition synthesis of quantum materials

npj Computational Materials ◽

10.1038/s41524-021-00535-3 ◽

2021 ◽

Vol 7 (1) ◽

Author(s):

Pankaj Rajak ◽

Aravind Krishnamoorthy ◽

Ankit Mishra ◽

Rajiv Kalia ◽

Aiichiro Nakano ◽

...

Keyword(s):

Chemical Vapor Deposition ◽

Reinforcement Learning ◽

Vapor Deposition ◽

Chemical Vapor ◽

Time Behavior ◽

Materials Synthesis ◽

Design Synthesis ◽

Learning Agent ◽

Threshold Temperatures ◽

Quantum Materials

AbstractPredictive materials synthesis is the primary bottleneck in realizing functional and quantum materials. Strategies for synthesis of promising materials are currently identified by time-consuming trial and error and there are no known predictive schemes to design synthesis parameters for materials. We use offline reinforcement learning (RL) to predict optimal synthesis schedules, i.e., a time-sequence of reaction conditions like temperatures and concentrations, for the synthesis of semiconducting monolayer MoS2 using chemical vapor deposition. The RL agent, trained on 10,000 computational synthesis simulations, learned threshold temperatures and chemical potentials for onset of chemical reactions and predicted previously unknown synthesis schedules that produce well-sulfidized crystalline, phase-pure MoS2. The model can be extended to multi-task objectives such as predicting profiles for synthesis of complex structures including multi-phase heterostructures and can predict long-time behavior of reacting systems, far beyond the domain of molecular dynamics simulations, making these predictions directly relevant to experimental synthesis.

Download Full-text