Optimal stationary strategies in leavable Markov decision processes

Matthias Fassbender

doi:10.1017/s0021900200038481

Optimal stationary strategies in leavable Markov decision processes

Journal of Applied Probability ◽

10.1017/s0021900200038481 ◽

1990 ◽

Vol 27 (01) ◽

pp. 134-145

Author(s):

Matthias Fassbender

Keyword(s):

Transition Probabilities ◽

Practical Applications ◽

Stationary Strategies ◽

Countable State ◽

Markov Decision ◽

Bias Optimality ◽

The Mean ◽

Optimal Stationary Strategies ◽

The Cost ◽

Reward Criterion

This paper establishes the existence of an optimal stationary strategy in a leavable Markov decision process with countable state space and undiscounted total reward criterion. Besides assumptions of boundedness and continuity, an assumption is imposed on the model which demands the continuity of the mean recurrence times on a subset of the stationary strategies, the so-called ‘good strategies'. For practical applications it is important that this assumption is implied by an assumption about the cost structure and the transition probabilities. In the last part we point out that our results in general cannot be deduced from related works on bias-optimality by Dekker and Hordijk, Wijngaard or Mann.

Download Full-text

Optimal stationary strategies in leavable Markov decision processes

Journal of Applied Probability ◽

10.2307/3214601 ◽

1990 ◽

Vol 27 (1) ◽

pp. 134-145

Author(s):

Matthias Fassbender

Keyword(s):

Transition Probabilities ◽

Practical Applications ◽

Stationary Strategies ◽

Countable State ◽

Markov Decision ◽

Bias Optimality ◽

The Mean ◽

Optimal Stationary Strategies ◽

The Cost ◽

Reward Criterion

This paper establishes the existence of an optimal stationary strategy in a leavable Markov decision process with countable state space and undiscounted total reward criterion.Besides assumptions of boundedness and continuity, an assumption is imposed on the model which demands the continuity of the mean recurrence times on a subset of the stationary strategies, the so-called ‘good strategies'. For practical applications it is important that this assumption is implied by an assumption about the cost structure and the transition probabilities. In the last part we point out that our results in general cannot be deduced from related works on bias-optimality by Dekker and Hordijk, Wijngaard or Mann.

Download Full-text

Nearly Optimal Stationary Strategies for the Total Reward Markov Decision Process

DGOR ◽

10.1007/978-3-642-68118-9_94 ◽

1981 ◽

pp. 501-501

Author(s):

J. Wal

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Total Reward ◽

Stationary Strategies ◽

Markov Decision ◽

Optimal Stationary Strategies

Download Full-text

On Stationary Strategies in Countable State Total Reward Markov Decision Processes

Mathematics of Operations Research ◽

10.1287/moor.9.2.290 ◽

1984 ◽

Vol 9 (2) ◽

pp. 290-300 ◽

Cited By ~ 10

Author(s):

Jan van der Wal

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Total Reward ◽

Stationary Strategies ◽

Countable State ◽

Markov Decision

Download Full-text

Linear Programming and Zero-Sum Two-Person Undiscounted Semi-Markov Games

Asia Pacific Journal of Operational Research ◽

10.1142/s0217595915500438 ◽

2015 ◽

Vol 32 (06) ◽

pp. 1550043 ◽

Cited By ~ 6

Author(s):

Prasenjit Mondal

Keyword(s):

Linear Programming ◽

Transition Probabilities ◽

Optimal Solution ◽

Programming Algorithm ◽

Stationary Strategy ◽

Markov Games ◽

Markov Decision ◽

Transition Times ◽

Optimal Stationary Strategies ◽

Zero Sum

In this paper, zero-sum two-person finite undiscounted (limiting average) semi-Markov games (SMGs) are considered. We prove that the solutions of the game when both players are restricted to semi-Markov strategies are solutions for the original game. In addition, we show that if one player fixes a stationary strategy, then the other player can restrict himself in solving an undiscounted semi-Markov decision process associated with that stationary strategy. The undiscounted SMGs are also studied when the transition probabilities and the transition times are controlled by a fixed player in all states. If such games are unichain, we prove that the value and optimal stationary strategies of the players can be obtained from an optimal solution of a linear programming algorithm. We propose a realistic and generalized traveling inspection model that suitably fits into the class of one player control undiscounted unichain semi-Markov games.

Download Full-text

Design and development of manually operated gladiolus planter

Journal of AgriSearch ◽

10.21921/jas.v5i01.11135 ◽

2018 ◽

Vol 5 (01) ◽

Author(s):

TAPAN K. KHURA ◽

H. L. KUSHWAHA ◽

SATISH D LANDE ◽

PKSAHOO . ◽

INDRA L . KUSHWAHA

Keyword(s):

Coefficient Of Variation ◽

Field Capacity ◽

Power Source ◽

Forward Speed ◽

Cereal Crop ◽

Maximum Coefficient ◽

Coefficient Of Uniformity ◽

The Mean ◽

The Cost ◽

Immense Potential

Floriculture is an age-old farming activity in India having immense potential for generating selfemployment and income to farmers. However, the cost of cultivation of flower is high as compared to cereal crop. Level of mechanization for different field operations is one but foremost reason for the higher cost of cultivation. As most of the Indian farmers are marginal and small, a need for manually operated gladiolus planter was felt. The geometric properties of gladiolus corm were determined for designing the seed metering system and seed hopper of the planter. The planter was evaluated in the field when pulled by two persons as a power source and guided by a person. The coefficient of variation and highest deviation from the mean spacing was observed as 12.93% and 2.65cm respectively. The maximum coefficient of uniformity of 90.59% was observed for a nominal corm spacing of 15cm at 0.56 kmh-1 forward speed. An average MISS percentage was observed as 2.65 and 2.25 for nominal corm spacing of 15 and 20 cm. The multiple index was zero for two levels corm spacing and forward speed of operation. The QFI was found in the range of 97.2 and 97.9 percent. The average field capacity of the planter was observed as 0.02 hah-1.The average draft requirement of the planter was found as 821 ± 50.3 N.

Download Full-text

A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion

SIAM Journal on Control and Optimization ◽

10.1137/19m1255811 ◽

2020 ◽

Vol 58 (4) ◽

pp. 2535-2566

Author(s):

François Dufour ◽

Alexandre Genadot

Keyword(s):

Convex Programming ◽

Markov Decision Processes ◽

Discrete Time ◽

Decision Processes ◽

Programming Approach ◽

Total Reward ◽

Markov Decision ◽

Reward Criterion

Download Full-text

Sensuator: A Hybrid Sensor–Actuator Approach to Soft Robotic Proprioception Using Recurrent Neural Networks

Actuators ◽

10.3390/act10020030 ◽

2021 ◽

Vol 10 (2) ◽

pp. 30

Author(s):

Pornthep Preechayasomboon ◽

Eric Rombokas

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Linear Models ◽

Open Loop ◽

Proof Of Concept ◽

State Estimator ◽

Loop Control ◽

Practical Applications ◽

Soft Actuator ◽

The Cost

Soft robotic actuators are now being used in practical applications; however, they are often limited to open-loop control that relies on the inherent compliance of the actuator. Achieving human-like manipulation and grasping with soft robotic actuators requires at least some form of sensing, which often comes at the cost of complex fabrication and purposefully built sensor structures. In this paper, we utilize the actuating fluid itself as a sensing medium to achieve high-fidelity proprioception in a soft actuator. As our sensors are somewhat unstructured, their readings are difficult to interpret using linear models. We therefore present a proof of concept of a method for deriving the pose of the soft actuator using recurrent neural networks. We present the experimental setup and our learned state estimator to show that our method is viable for achieving proprioception and is also robust to common sensor failures.

Download Full-text

Ring-Forming Polymerization toward Perfluorocyclobutyl and Ortho-Diynylarene-Derived Materials: From Synthesis to Practical Applications

Materials ◽

10.3390/ma14061486 ◽

2021 ◽

Vol 14 (6) ◽

pp. 1486

Author(s):

Eugene B. Caldona ◽

Ernesto I. Borrego ◽

Ketki E. Shelar ◽

Karl M. Mukeba ◽

Dennis W. Smith

Keyword(s):

Chemical Resistance ◽

Structural Features ◽

Review Article ◽

Aryl Ether ◽

Aromatic Rings ◽

Practical Applications ◽

Repeat Units ◽

Wide Range ◽

Synthetic Routes ◽

The Cost

Many desirable characteristics of polymers arise from the method of polymerization and structural features of their repeat units, which typically are responsible for the polymer’s performance at the cost of processability. While linear alternatives are popular, polymers composed of cyclic repeat units across their backbones have generally been shown to exhibit higher optical transparency, lower water absorption, and higher glass transition temperatures. These specifically include polymers built with either substituted alicyclic structures or aromatic rings, or both. In this review article, we highlight two useful ring-forming polymer groups, perfluorocyclobutyl (PFCB) aryl ether polymers and ortho-diynylarene- (ODA) based thermosets, both demonstrating outstanding thermal stability, chemical resistance, mechanical integrity, and improved processability. Different synthetic routes (with emphasis on ring-forming polymerization) and properties for these polymers are discussed, followed by their relevant applications in a wide range of aspects.

Download Full-text

Seasonal Influenza and Low Flu Vaccination Coverage as Important Factors Modifying the Costs and Availability of Hospital Services in Poland: A Retrospective Comparative Study

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18105173 ◽

2021 ◽

Vol 18 (10) ◽

pp. 5173

Author(s):

Robert Susło ◽

Piotr Pobrotyn ◽

Lidia Brydak ◽

Łukasz Rypicz ◽

Urszula Grata-Borkowska ◽

...

Keyword(s):

Influenza Season ◽

Influenza Infection ◽

Hospital Staff ◽

University Hospital ◽

Control Group ◽

Admission Rates ◽

Risk Of Death ◽

The Mean ◽

Retrospective Comparative Study ◽

The Cost

Introduction: Influenza infection is associated with potential serious complications, increased hospitalization rates, and a higher risk of death. Materials and Methods: A retrospective comparative analysis of selected indicators of hospitalization from the University Hospital in Wroclaw, Poland, was carried out on patients with confirmed influenza infection in comparison to a control group randomly selected from among all other patients hospitalized on the respective wards during the 2018–2019 influenza season. Results: The mean laboratory testing costs for the entire hospital were 3.74-fold higher and the mean imaging test costs were 4.02-fold higher for patients with confirmed influenza than for the control group; the hospital expenses were additionally raised by the cost of antiviral therapy, which is striking when compared against the cost of a single flu vaccine. During the 2018–2019 influenza season, influenza infections among the hospital patients temporarily limited the healthcare service availability in the institution, which resulted in reduced admission rates to the departments related to internal medicine; the mean absence among the hospital staff totaled approximately 7 h per employee, despite 7.3% of the staff having been vaccinated against influenza at the hospital’s expense. Conclusions: There were significant differences in the hospitalization indicators between the patients with confirmed influenza and the control group, which markedly increased the hospital care costs in this multi-specialty university hospital.

Download Full-text

Robust analysis of discounted Markov decision processes with uncertain transition probabilities

Applied Mathematics-A Journal of Chinese Universities ◽

10.1007/s11766-020-3664-1 ◽

2020 ◽

Vol 35 (4) ◽

pp. 417-436

Author(s):

Zhen-kai Lou ◽

Fu-jun Hou ◽

Xu-ming Lou

Keyword(s):

Markov Decision Processes ◽

Transition Probabilities ◽

Decision Processes ◽

Robust Analysis ◽

Markov Decision

Download Full-text