Big Data Methods

2016 ◽  
Vol 21 (3) ◽  
pp. 525-547 ◽  
Author(s):  
Scott Tonidandel ◽  
Eden B. King ◽  
Jose M. Cortina

Advances in data science, such as data mining, data visualization, and machine learning, are extremely well-suited to address numerous questions in the organizational sciences given the explosion of available data. Despite these opportunities, few scholars in our field have discussed the specific ways in which the lens of our science should be brought to bear on the topic of big data and big data's reciprocal impact on our science. The purpose of this paper is to provide an overview of the big data phenomenon and its potential for impacting organizational science in both positive and negative ways. We identifying the biggest opportunities afforded by big data along with the biggest obstacles, and we discuss specifically how we think our methods will be most impacted by the data analytics movement. We also provide a list of resources to help interested readers incorporate big data methods into their existing research. Our hope is that we stimulate interest in big data, motivate future research using big data sources, and encourage the application of associated data science techniques more broadly in the organizational sciences.

Author(s):  
Natanael Vitor Sobral ◽  
Gillian Leandro de Queiroga Lima ◽  
Ana Sara Pereira de Melo Sobral

Objetivo: realizar análise bibliométrica sobre as aplicações da ciência de dados no âmbito das organizações hospitalares. Método: por meio de pesquisa na base de dados Web of Science, verificou-se a existência de termos relacionados à ciência de dados, tais como “big data”, “data analytics”, “businesss intelligence”, “data mining”, “data warehouse”, “text mining” e “data science", relacionando-os a hospitais. A análise de dados pautou-se na técnica de análise de redes sociais. O período considerado foi de 2015 a 2019. Resultado: “machine learning” e “electronic health records” despontam como assuntos relevantes. As interações mais expressivas refletem a inclinação da informática médica em assuntos relacionados à tomada de decisão, sistemas de informação para hospitais e unidade de cuidados intensivos. Sobre os campos científicos, nota-se a predominância esperada da área de saúde e dos domínios pertencentes ou fronteiriços à tecnologia. No mais, vê-se que a grande variedade de áreas encontradas acusa a natureza multidisciplinar do assunto, inclusive com importante participação da Ciência da Informação (CI). Em relação à geografia do conhecimento, observa-se um razoável grau de descentralização, havendo produções representativas na América do Norte, Europa e Ásia. Quanto aos veículos de publicação, destaque para os Studies in Health Technology and Informatics, que compreendem uma série de publicações. Os dois periódicos mais representativos da lista, integram, respectivamente, os grupos Springer Nature e Elsevier, grandes players do mercado editorial científico. Conclusões: por fim, evidencia-se a multidisciplinaridade existente em torno do assunto estudado e a relevância da tecnologia para o progresso das organizações hospitalares.


2022 ◽  
pp. 1477-1503
Author(s):  
Ali Al Mazari

HIV/AIDS big data analytics evolved as a potential initiative enabling the connection between three major scientific disciplines: (1) the HIV biology emergence and evolution; (2) the clinical and medical complex problems and practices associated with the infections and diseases; and (3) the computational methods for the mining of HIV/AIDS biological, medical, and clinical big data. This chapter provides a review on the computational and data mining perspectives on HIV/AIDS in big data era. The chapter focuses on the research opportunities in this domain, identifies the challenges facing the development of big data analytics in HIV/AIDS domain, and then highlights the future research directions of big data in the healthcare sector.


2020 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Roberto Salazar-Reyna ◽  
Fernando Gonzalez-Aleu ◽  
Edgar M.A. Granda-Gutierrez ◽  
Jenny Diaz-Ramirez ◽  
Jose Arturo Garza-Reyes ◽  
...  

PurposeThe objective of this paper is to assess and synthesize the published literature related to the application of data analytics, big data, data mining and machine learning to healthcare engineering systems.Design/methodology/approachA systematic literature review (SLR) was conducted to obtain the most relevant papers related to the research study from three different platforms: EBSCOhost, ProQuest and Scopus. The literature was assessed and synthesized, conducting analysis associated with the publications, authors and content.FindingsFrom the SLR, 576 publications were identified and analyzed. The research area seems to show the characteristics of a growing field with new research areas evolving and applications being explored. In addition, the main authors and collaboration groups publishing in this research area were identified throughout a social network analysis. This could lead new and current authors to identify researchers with common interests on the field.Research limitations/implicationsThe use of the SLR methodology does not guarantee that all relevant publications related to the research are covered and analyzed. However, the authors' previous knowledge and the nature of the publications were used to select different platforms.Originality/valueTo the best of the authors' knowledge, this paper represents the most comprehensive literature-based study on the fields of data analytics, big data, data mining and machine learning applied to healthcare engineering systems.


High volumes and varieties of data is piling every day from healthcare and related fields. This big data sources if managed and analysed properly will provide vital knowledge. Data mining and data analytics have been playing an important role in extracting useful information from healthcare and related data sources. The knowledge extracted from these data sources guiding patients and healthcare personnel towards improved health conditions. Analytical techniques from statistics, functionalities from data mining and machine learning already proved their capability with significant contributions to healthcare industry. The dominant functionality of data mining is classification which has been in use in mining healthcare data. Though classification is a good learning technique it may not provide a causation model which will be a reliable model for better decision making particularly in the medical field. The present models for causality have limitations in terms of scalability and reliability. The present study is targeted to study causal models for causal relationship mining. This study tried to conclude with some proposals for causal relationship discovery which are efficient, reliable and scalable. The proposed model is going to make use of some qualities of decision trees along with statistical tests and analytics. It is proposed to build the learning models on healthcare big data sources.


Author(s):  
Cerene Mariam Abraham ◽  
Mannathazhathu Sudheep Elayidom ◽  
Thankappan Santhanakrishnan

Background: Machine learning is one of the most popular research areas today. It relates closely to the field of data mining, which extracts information and trends from large datasets. Aims: The objective of this paper is to (a) illustrate big data analytics for the Indian derivative market and (b) identify trends in the data. Methods: Based on input from experts in the equity domain, the data are verified statistically using data mining techniques. Specifically, ten years of daily derivative data is used for training and testing purposes. The methods that are adopted for this research work include model generation using ARIMA, Hadoop framework which comprises mapping and reducing for big data analysis. Results: The results of this work are the observation of a trend that indicates the rise and fall of price in derivatives , generation of time-series similarity graph and plotting of frequency of temporal data. Conclusion: Big data analytics is an underexplored topic in the Indian derivative market and the results from this paper can be used by investors to earn both short-term and long-term benefits.


2021 ◽  
Vol 14 (5) ◽  
pp. 1358-1359
Author(s):  
Vangipuram Radhakrishna ◽  
Gunupudi Rajesh Kumar ◽  
Gali Suresh Reddy ◽  
Dammavalam Srinivasa Rao


2020 ◽  
Vol 9 (12) ◽  
pp. 752
Author(s):  
Anna Kovacs-Györi ◽  
Alina Ristea ◽  
Clemens Havas ◽  
Michael Mehaffy ◽  
Hartwig H. Hochmair ◽  
...  

Urban systems involve a multitude of closely intertwined components, which are more measurable than before due to new sensors, data collection, and spatio-temporal analysis methods. Turning these data into knowledge to facilitate planning efforts in addressing current challenges of urban complex systems requires advanced interdisciplinary analysis methods, such as urban informatics or urban data science. Yet, by applying a purely data-driven approach, it is too easy to get lost in the ‘forest’ of data, and to miss the ‘trees’ of successful, livable cities that are the ultimate aim of urban planning. This paper assesses how geospatial data, and urban analysis, using a mixed methods approach, can help to better understand urban dynamics and human behavior, and how it can assist planning efforts to improve livability. Based on reviewing state-of-the-art research the paper goes one step further and also addresses the potential as well as limitations of new data sources in urban analytics to get a better overview of the whole ‘forest’ of these new data sources and analysis methods. The main discussion revolves around the reliability of using big data from social media platforms or sensors, and how information can be extracted from massive amounts of data through novel analysis methods, such as machine learning, for better-informed decision making aiming at urban livability improvement.


Author(s):  
Neeti Sangwan ◽  
Vishal Bhatnagar

In Big Data analysis, the application of machine learning has proven to be a revolutionary. The systematic review of literature shows that research has been carried out on the domain of big data analytics particularly text analytics with the inclusion of machine learning approaches. This extensive survey deals with the data at hand that provides different ways and issues while combining the machine learning approaches with the text. During the course of the survey, various publications in the field of synchronous application of machine learning in text analytics were searched and studied. Classification framework is proposed as the contribution of machine learning in text analytics. A classification framework represented the various application areas to motivate researchers for future research on the application of two emerging technologies.


Sign in / Sign up

Export Citation Format

Share Document