mirror descent Latest Research Papers

Event-Triggered Distributed Stochastic Mirror Descent for Convex Optimization

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3137010 ◽

2022 ◽

pp. 1-12

Author(s):

Menghui Xiong ◽

Baoyong Zhang ◽

Daniel W. C. Ho ◽

Deming Yuan ◽

Shengyuan Xu

Keyword(s):

Convex Optimization ◽

Mirror Descent ◽

Event Triggered

Download Full-text

Mirror descent algorithm on the indefinite control horizon

Journal of Physics Conference Series ◽

10.1088/1742-6596/2052/1/012039 ◽

2021 ◽

Vol 2052 (1) ◽

pp. 012039

Author(s):

D N Shiyan ◽

A V Kolnogorov

Keyword(s):

Optimal Control ◽

Data Processing ◽

Random Environment ◽

Optimal Strategy ◽

Control Algorithm ◽

Batch Processing ◽

Significant Gain ◽

Descent Algorithm ◽

Mirror Descent ◽

Classical Statement

Abstract We consider the problem of optimal control in a random environment in a minimax setting as applied to data processing. It is assumed that the random environment provides two methods of data processing, the effectiveness of which is not known in advance. The goal of the control in this case is to find the optimal strategy for the application of processing methods and to minimize losses. To solve this problem, the mirror descent algorithm is used, including its modifications for batch processing. The use of algorithms for batch processing allows us to get a significant gain in speed due to the parallel processing of batches. In the classical statement, the search for the optimal strategy is considered on a fixed control horizon but this article considers an indefinite control horizon. With an indefinite horizon, the control algorithm cannot use information about the value of the horizon when searching for an optimal strategy. Using numerical modeling, the operation of the mirror descent algorithm and its modifications on an indefinite control horizon is studied and obtained results are presented.

Download Full-text

Distributed Mirror Descent With Integral Feedback: Asymptotic Convergence Analysis of Continuous-Time Dynamics

IEEE Control Systems Letters ◽

10.1109/lcsys.2020.3040934 ◽

2021 ◽

Vol 5 (5) ◽

pp. 1507-1512

Author(s):

Youbang Sun ◽

Shahin Shahrampour

Keyword(s):

Convergence Analysis ◽

Continuous Time ◽

Asymptotic Convergence ◽

Time Dynamics ◽

Mirror Descent ◽

Integral Feedback

Download Full-text

Mirror Descent and the MultiplicativeWeights Update

10.1017/9781108699211.009 ◽

2021 ◽

pp. 108-142

Keyword(s):

Mirror Descent

Download Full-text

Two-armed bandit problem and batch version of the mirror descent algorithm

Mathematical Game Theory and Applications ◽

10.17076/mgta_2021_2_34 ◽

2021 ◽

Vol 13 (2) ◽

pp. 9-39

Author(s):

Александр Валерианович Колногоров ◽

Alexander Kolnogorov ◽

Александр Викторович Назин ◽

Alexander Nazin ◽

Дмитрий Николаевич Шиян ◽

...

Keyword(s):

Data Processing ◽

A Priori ◽

Control Performance ◽

Bandit Problem ◽

Minimax Risk ◽

Descent Algorithm ◽

Alternative Processing ◽

Mirror Descent ◽

Processing Data ◽

Parallel Data

We consider the minimax setup for the two-armed bandit problem as applied to data processing if there are two alternative processing methods with different a priori unknown efficiencies. One should determine the most efficient method and provide its predominant application. To this end, we use the mirror descent algorithm (MDA). It is well-known that corresponding minimax risk has the order of $N^{1/2$ with $N$ being the number of processed data and this bound is unimprovable in order. We propose a batch version of the MDA which allows processing data by packets that is especially important if parallel data processing can be provided. In this case, the processing time is determined by the number of batches rather than by the total number of data. Unexpectedly, it turned out that the batch version behaves unlike the ordinary one even if the number of packets is large. Moreover, the batch version provides significantly smaller value of the minimax risk, i.e., it considerably improves a control performance. We explain this result by considering another batch modification of the MDA which behavior is close to behavior of the ordinary version and minimax risk is close as well. Our estimates use invariant descriptions of the algorithms based on Gaussian approximations of incomes in batches of data in the domain of ``close'' distributions and are obtained by Monte-Carlo simulations.

Download Full-text

Algorithms for Convex Optimization

10.1017/9781108699211 ◽

2021 ◽

Author(s):

Nisheeth K. Vishnoi

Keyword(s):

Convex Optimization ◽

Data Science ◽

Optimization Problems ◽

Algorithm Design ◽

Continuous Optimization ◽

Maximum Flow ◽

Maximum Matching ◽

Science Data ◽

Mirror Descent ◽

Continuous Optimization Problems

In the last few years, Algorithms for Convex Optimization have revolutionized algorithm design, both for discrete and continuous optimization problems. For problems like maximum flow, maximum matching, and submodular function minimization, the fastest algorithms involve essential methods such as gradient descent, mirror descent, interior point methods, and ellipsoid methods. The goal of this self-contained book is to enable researchers and professionals in computer science, data science, and machine learning to gain an in-depth understanding of these algorithms. The text emphasizes how to derive key algorithms for convex optimization from first principles and how to establish precise running time bounds. This modern text explains the success of these algorithms in problems of discrete optimization, as well as how these methods have significantly pushed the state of the art of convex optimization itself.

Download Full-text

Reinforcement learning with constraint based on mirror descent algorithm

Results in Control and Optimization ◽

10.1016/j.rico.2021.100048 ◽

2021 ◽

pp. 100048

Author(s):

Megumi Miyashita ◽

Toshiyuki Kondo ◽

Shiro Yano

Keyword(s):

Reinforcement Learning ◽

Descent Algorithm ◽

Mirror Descent

Download Full-text

Differentially-Private Federated Learning with Long-Term Constraints Using Online Mirror Descent

10.1109/isit45174.2021.9518177 ◽

2021 ◽

Author(s):

Olusola Odeyomi ◽

Gergely Zaruba

Keyword(s):

Mirror Descent

Download Full-text

Fastest rates for stochastic mirror descent methods

Computational Optimization and Applications ◽

10.1007/s10589-021-00284-5 ◽

2021 ◽

Author(s):

Filip Hanzely ◽

Peter Richtárik

Keyword(s):

Descent Methods ◽

Mirror Descent

Download Full-text

Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with β-Divergence

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9413830 ◽

2021 ◽

Author(s):

Wenqiang Pu ◽

Shahana Ibrahim ◽

Xiao Fu ◽

Mingyi Hong

Keyword(s):

Tensor Decomposition ◽

Mirror Descent

Download Full-text

mirror descent
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Event-Triggered Distributed Stochastic Mirror Descent for Convex Optimization

Mirror descent algorithm on the indefinite control horizon

Distributed Mirror Descent With Integral Feedback: Asymptotic Convergence Analysis of Continuous-Time Dynamics

Mirror Descent and the MultiplicativeWeights Update

Two-armed bandit problem and batch version of the mirror descent algorithm

Algorithms for Convex Optimization

Reinforcement learning with constraint based on mirror descent algorithm

Differentially-Private Federated Learning with Long-Term Constraints Using Online Mirror Descent

Fastest rates for stochastic mirror descent methods

Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with β-Divergence

Export Citation Format

mirror descentRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Event-Triggered Distributed Stochastic Mirror Descent for Convex Optimization

Mirror descent algorithm on the indefinite control horizon

Distributed Mirror Descent With Integral Feedback: Asymptotic Convergence Analysis of Continuous-Time Dynamics

Mirror Descent and the MultiplicativeWeights Update

Two-armed bandit problem and batch version of the mirror descent algorithm

Algorithms for Convex Optimization

Reinforcement learning with constraint based on mirror descent algorithm

Differentially-Private Federated Learning with Long-Term Constraints Using Online Mirror Descent

Fastest rates for stochastic mirror descent methods

Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with β-Divergence

mirror descent
Recently Published Documents