Optimal stationary strategies in leavable Markov decision processes

1990 ◽  
Vol 27 (01) ◽  
pp. 134-145
Author(s):  
Matthias Fassbender

This paper establishes the existence of an optimal stationary strategy in a leavable Markov decision process with countable state space and undiscounted total reward criterion. Besides assumptions of boundedness and continuity, an assumption is imposed on the model which demands the continuity of the mean recurrence times on a subset of the stationary strategies, the so-called ‘good strategies'. For practical applications it is important that this assumption is implied by an assumption about the cost structure and the transition probabilities. In the last part we point out that our results in general cannot be deduced from related works on bias-optimality by Dekker and Hordijk, Wijngaard or Mann.

1990 ◽  
Vol 27 (1) ◽  
pp. 134-145
Author(s):  
Matthias Fassbender

This paper establishes the existence of an optimal stationary strategy in a leavable Markov decision process with countable state space and undiscounted total reward criterion.Besides assumptions of boundedness and continuity, an assumption is imposed on the model which demands the continuity of the mean recurrence times on a subset of the stationary strategies, the so-called ‘good strategies'. For practical applications it is important that this assumption is implied by an assumption about the cost structure and the transition probabilities. In the last part we point out that our results in general cannot be deduced from related works on bias-optimality by Dekker and Hordijk, Wijngaard or Mann.


2015 ◽  
Vol 32 (06) ◽  
pp. 1550043 ◽  
Author(s):  
Prasenjit Mondal

In this paper, zero-sum two-person finite undiscounted (limiting average) semi-Markov games (SMGs) are considered. We prove that the solutions of the game when both players are restricted to semi-Markov strategies are solutions for the original game. In addition, we show that if one player fixes a stationary strategy, then the other player can restrict himself in solving an undiscounted semi-Markov decision process associated with that stationary strategy. The undiscounted SMGs are also studied when the transition probabilities and the transition times are controlled by a fixed player in all states. If such games are unichain, we prove that the value and optimal stationary strategies of the players can be obtained from an optimal solution of a linear programming algorithm. We propose a realistic and generalized traveling inspection model that suitably fits into the class of one player control undiscounted unichain semi-Markov games.


2018 ◽  
Vol 5 (01) ◽  
Author(s):  
TAPAN K. KHURA ◽  
H. L. KUSHWAHA ◽  
SATISH D LANDE ◽  
PKSAHOO . ◽  
INDRA L . KUSHWAHA

Floriculture is an age-old farming activity in India having immense potential for generating selfemployment and income to farmers. However, the cost of cultivation of flower is high as compared to cereal crop. Level of mechanization for different field operations is one but foremost reason for the higher cost of cultivation. As most of the Indian farmers are marginal and small, a need for manually operated gladiolus planter was felt. The geometric properties of gladiolus corm were determined for designing the seed metering system and seed hopper of the planter. The planter was evaluated in the field when pulled by two persons as a power source and guided by a person. The coefficient of variation and highest deviation from the mean spacing was observed as 12.93% and 2.65cm respectively. The maximum coefficient of uniformity of 90.59% was observed for a nominal corm spacing of 15cm at 0.56 kmh-1 forward speed. An average MISS percentage was observed as 2.65 and 2.25 for nominal corm spacing of 15 and 20 cm. The multiple index was zero for two levels corm spacing and forward speed of operation. The QFI was found in the range of 97.2 and 97.9 percent. The average field capacity of the planter was observed as 0.02 hah-1.The average draft requirement of the planter was found as 821 ± 50.3 N.


Actuators ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 30
Author(s):  
Pornthep Preechayasomboon ◽  
Eric Rombokas

Soft robotic actuators are now being used in practical applications; however, they are often limited to open-loop control that relies on the inherent compliance of the actuator. Achieving human-like manipulation and grasping with soft robotic actuators requires at least some form of sensing, which often comes at the cost of complex fabrication and purposefully built sensor structures. In this paper, we utilize the actuating fluid itself as a sensing medium to achieve high-fidelity proprioception in a soft actuator. As our sensors are somewhat unstructured, their readings are difficult to interpret using linear models. We therefore present a proof of concept of a method for deriving the pose of the soft actuator using recurrent neural networks. We present the experimental setup and our learned state estimator to show that our method is viable for achieving proprioception and is also robust to common sensor failures.


Materials ◽  
2021 ◽  
Vol 14 (6) ◽  
pp. 1486
Author(s):  
Eugene B. Caldona ◽  
Ernesto I. Borrego ◽  
Ketki E. Shelar ◽  
Karl M. Mukeba ◽  
Dennis W. Smith

Many desirable characteristics of polymers arise from the method of polymerization and structural features of their repeat units, which typically are responsible for the polymer’s performance at the cost of processability. While linear alternatives are popular, polymers composed of cyclic repeat units across their backbones have generally been shown to exhibit higher optical transparency, lower water absorption, and higher glass transition temperatures. These specifically include polymers built with either substituted alicyclic structures or aromatic rings, or both. In this review article, we highlight two useful ring-forming polymer groups, perfluorocyclobutyl (PFCB) aryl ether polymers and ortho-diynylarene- (ODA) based thermosets, both demonstrating outstanding thermal stability, chemical resistance, mechanical integrity, and improved processability. Different synthetic routes (with emphasis on ring-forming polymerization) and properties for these polymers are discussed, followed by their relevant applications in a wide range of aspects.


Author(s):  
Robert Susło ◽  
Piotr Pobrotyn ◽  
Lidia Brydak ◽  
Łukasz Rypicz ◽  
Urszula Grata-Borkowska ◽  
...  

Introduction: Influenza infection is associated with potential serious complications, increased hospitalization rates, and a higher risk of death. Materials and Methods: A retrospective comparative analysis of selected indicators of hospitalization from the University Hospital in Wroclaw, Poland, was carried out on patients with confirmed influenza infection in comparison to a control group randomly selected from among all other patients hospitalized on the respective wards during the 2018–2019 influenza season. Results: The mean laboratory testing costs for the entire hospital were 3.74-fold higher and the mean imaging test costs were 4.02-fold higher for patients with confirmed influenza than for the control group; the hospital expenses were additionally raised by the cost of antiviral therapy, which is striking when compared against the cost of a single flu vaccine. During the 2018–2019 influenza season, influenza infections among the hospital patients temporarily limited the healthcare service availability in the institution, which resulted in reduced admission rates to the departments related to internal medicine; the mean absence among the hospital staff totaled approximately 7 h per employee, despite 7.3% of the staff having been vaccinated against influenza at the hospital’s expense. Conclusions: There were significant differences in the hospitalization indicators between the patients with confirmed influenza and the control group, which markedly increased the hospital care costs in this multi-specialty university hospital.


Sign in / Sign up

Export Citation Format

Share Document