A regression model for pooled data in a two-stage survey under informative sampling with application for detecting and estimating the presence of transgenic corn

AbstractGroup-testing regression methods are effective for estimating and classifying binary responses and can substantially reduce the number of required diagnostic tests. However, there is no appropriate methodology when the sampling process is complex and informative. In these cases, researchers often ignore stratification and weights that can severely bias the estimates of the population parameters. In this paper, we develop group-testing regression models for analysing two-stage surveys with unequal selection probabilities and informative sampling. Weights are incorporated into the likelihood function using the pseudo-likelihood approach. A simulation study demonstrates that the proposed model reduces the bias in estimation considerably compared to other methods that ignore the weights. Finally, we apply the model for estimating the presence of transgenic corn in Mexico and we give the SAS code used for the analysis.

Download Full-text

Diversity Balancing for Two-Stage Collaborative Filtering in Recommender Systems

Applied Sciences ◽

10.3390/app10041257 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1257 ◽

Cited By ~ 2

Author(s):

Liang Zhang ◽

Quanshen Wei ◽

Lei Zhang ◽

Baojiao Wang ◽

Wen-Hsien Ho

Keyword(s):

Collaborative Filtering ◽

Recommender Systems ◽

Social Relationships ◽

Two Stage ◽

Proposed Model ◽

The Social ◽

Item List ◽

Ranking Strategy ◽

High Prediction ◽

Optimization Mechanism

Conventional recommender systems are designed to achieve high prediction accuracy by recommending items expected to be the most relevant and interesting to users. Therefore, they tend to recommend only the most popular items. Studies agree that diversity of recommendations is as important as accuracy because it improves the customer experience by reducing monotony. However, increasing diversity reduces accuracy. Thus, a recommendation algorithm is needed to recommend less popular items while maintaining acceptable accuracy. This work proposes a two-stage collaborative filtering optimization mechanism that obtains a complete and diversified item list. The first stage of the model incorporates multiple interests to optimize neighbor selection. In addition to using conventional collaborative filtering to predict ratings by exploiting available ratings, the proposed model further considers the social relationships of the user. A novel ranking strategy is then used to rearrange the list of top-N items while maintaining accuracy by (1) rearranging the area controlled by the threshold and by (2) maximizing popularity while maintaining an acceptable reduction in accuracy. An extensive experimental evaluation performed in a real-world dataset confirmed that, for a given loss of accuracy, the proposed model achieves higher diversity compared to conventional approaches.

Download Full-text

The Exact Solution to the Two-Stage Group-Testing Problem

Technometrics ◽

10.1080/00401706.1978.10489706 ◽

1978 ◽

Vol 20 (4) ◽

pp. 497-500 ◽

Cited By ~ 15

Author(s):

S. M. Samuels

Keyword(s):

Exact Solution ◽

Group Testing ◽

Two Stage ◽

Testing Problem

Download Full-text

Optimal sample sizes for group testing in two-stage sampling

Seed Science Research ◽

10.1017/s096025851400035x ◽

2014 ◽

Vol 25 (01) ◽

pp. 12-28 ◽

Cited By ~ 1

Author(s):

Osval Antonio Montesinos-López ◽

Kent Eskridge ◽

Abelardo Montesinos-López ◽

José Crossa

Keyword(s):

Group Testing ◽

Sample Sizes ◽

Two Stage ◽

Optimal Sample

Download Full-text

A Two-Stage Exon Recognition Model Based on Synergetic Neural Network

Computational and Mathematical Methods in Medicine ◽

10.1155/2014/503132 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 4

Author(s):

Zhehuang Huang ◽

Yidong Chen

Keyword(s):

Neural Network ◽

Digital Signal ◽

Two Stage ◽

Recognition Model ◽

Artificial Fish Swarm Algorithm ◽

Proposed Model ◽

Artificial Fish Swarm ◽

Recognition Efficiency ◽

Processing Techniques ◽

Signal Processing Techniques

Exon recognition is a fundamental task in bioinformatics to identify the exons of DNA sequence. Currently, exon recognition algorithms based on digital signal processing techniques have been widely used. Unfortunately, these methods require many calculations, resulting in low recognition efficiency. In order to overcome this limitation, a two-stage exon recognition model is proposed and implemented in this paper. There are three main works. Firstly, we use synergetic neural network to rapidly determine initial exon intervals. Secondly, adaptive sliding window is used to accurately discriminate the final exon intervals. Finally, parameter optimization based on artificial fish swarm algorithm is used to determine different species thresholds and corresponding adjustment parameters of adaptive windows. Experimental results show that the proposed model has better performance for exon recognition and provides a practical solution and a promising future for other recognition tasks.

Download Full-text

A two-stage analogue model for real-time urban flood forecasting

10.5194/egusphere-egu21-15645 ◽

2021 ◽

Author(s):

Chris Onof ◽

Yuting Chen ◽

Li-Pen Wang ◽

Amy Jones ◽

Susana Ochoa Rodriguez

Keyword(s):

Real Time ◽

Flood Forecasting ◽

Image Feature ◽

Two Stage ◽

Analogue Model ◽

Flood Prediction ◽

Urban Flood ◽

Radar Images ◽

Proposed Model ◽

Flood Maps

In this work a two-stage (rainfall nowcasting + flood prediction) analogue model for real-time urban flood forecasting is presented. The proposed approach accounts for the complexities of urban rainfall nowcasting while avoiding the expensive computational requirements of real-time urban flood forecasting.The model has two consecutive stages:<ul><li>(1) Rainfall nowcasting: 0-6h lead time ensemble rainfall nowcasting is achieved by means of an analogue method, based on the assumption that similar climate condition will define similar patterns of temporal evolution of the rainfall. The framework uses the NORA analogue-based forecasting tool (Panziera et al., 2011), consisting of two layers. In the first layer, the 120 historical atmospheric (forcing) conditions most similar to the current atmospheric conditions are extracted, with the historical database consisting of ERA5 reanalysis data from the ECMWF and the current conditions derived from the US Global Forecasting System (GFS). In the second layer, twelve historical radar images most similar to the current one are extracted from amongst the historical radar images linked to the aforementioned 120 forcing analogues. Lastly, for each of the twelve analogues, the rainfall fields (at resolution of 1km/5min) observed after the present time are taken as one ensemble member. Note that principal component analysis (PCA) and uncorrelated multilinear PCA methods were tested for image feature extraction prior to applying the nearest neighbour technique for analogue selection.</li> <li>(2) Flood prediction: we predict flood extent using the high-resolution rainfall forecast from Stage 1, along with a database of pre-run flood maps at 1x1 km2 solution from 157 catalogued historical flood events. A deterministic flood prediction is obtained by using the averaged response from the twelve flood maps associated to the twelve ensemble rainfall nowcasts, where for each gridded area the median value is adopted (assuming flood maps are equiprobabilistic). A probabilistic flood prediction is obtained by generating a quantile-based flood map. Note that the flood maps were generated through rolling ball-based mapping of the flood volumes predicted at each node of the InfoWorks ICM sewer model of the pilot area.</li> </ul>The Minworth catchment in the UK (~400 km2) was used to demonstrate the proposed model. Cross&#8209;assessment was undertaken for each of 157 flooding events by leaving one event out from training in each iteration and using it for evaluation. With a focus on the spatial replication of flood/non-flood patterns, the predicted flood maps were converted to binary (flood/non-flood) maps. Quantitative assessment was undertaken by means of a contingency table. An average accuracy rate (i.e. proportion of correct predictions, out of all test events) of 71.4% was achieved, with individual accuracy rates ranging from 57.1% to 78.6%). Further testing is needed to confirm initial findings and flood mapping refinement will be pursued.The proposed model is fast, easy and relatively inexpensive to operate, making it suitable for direct use by local authorities who often lack the expertise on and/or capabilities for flood modelling and forecasting.References: Panziera et al. 2011. NORA&#8211;Nowcasting of Orographic Rainfall by means of Analogues. Quarterly Journal of the Royal Meteorological Society. 137, 2106-2123.

Download Full-text

Bounds on the efficiency of two-stage group testing

Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252) ◽

10.1109/isit.1998.708694 ◽

2002 ◽

Author(s):

T. Berger ◽

J.W. Mandell

Keyword(s):

Group Testing ◽

Two Stage

Download Full-text

Adaptive Procedures for the Two-Stage Group-Testing Problem Based on Prior Distributions and Costs

Technometrics ◽

10.1080/00401706.1990.10484726 ◽

1990 ◽

Vol 32 (4) ◽

pp. 397-405 ◽

Cited By ~ 4

Author(s):

Helmut Schneider ◽

Kwei Tang

Keyword(s):

Group Testing ◽

Two Stage ◽

Prior Distributions ◽

Testing Problem ◽

Adaptive Procedures

Download Full-text

Spatial regression methods to evaluate beekeeping production in the state of Rio de Janeiro

Arquivo Brasileiro de Medicina Veterinária e Zootecnia ◽

10.1590/s0102-09352013000200035 ◽

2013 ◽

Vol 65 (2) ◽

pp. 553-558

Author(s):

W.S. Tassinari ◽

M.C. Lorenzon ◽

E.L. Peixoto

Keyword(s):

Rio De Janeiro ◽

High Performance ◽

Spatial Regression ◽

The State ◽

Spatial Effects ◽

Regression Methods ◽

Honey Production ◽

Production Factors ◽

Proposed Model ◽

Wild Vegetation

Brazilian beekeeping has been developed from the africanization of the honeybees and its high performance launches Brazil as one of the world´s largest honey producer. The Southeastern region has an expressive position in this market (45%), but the state of Rio de Janeiro is the smallest producer, despite presenting large areas of wild vegetation for honey production. In order to analyze the honey productivity in the state of Rio de Janeiro, this research used classic and spatial regression approaches. The data used in this study comprised the responses regarding beekeeping from 1418 beekeepers distributed throughout 72 counties of this state. The best statistical fit was a semiparametric spatial model. The proposed model could be used to estimate the annual honey yield per hive in regions and to detect production factors more related to beekeeping. Honey productivity was associated with the number of hives, wild swarm collection and losses in the apiaries. This paper highlights that the beekeeping sector needs support and help to elucidate the problems plaguing beekeepers, and the inclusion of spatial effects in the regression models is a useful tool in geographical data.

Download Full-text