Mining big data to extract patterns and predict real-life outcomes.
Publication type: Journal Article
Publication date: 2016-12-05
scimago Q1
wos Q1
SJR: 4.925
CiteScore: 18.0
Impact factor: 7.8
ISSN: 1082989X, 19391463
PubMed ID:
27918179
Psychology (miscellaneous)
Abstract
This article aims to introduce the reader to essential tools that can be used to obtain insights and build predictive models using large data sets. Recent user proliferation in the digital environment has led to the emergence of large samples containing a wealth of traces of human behaviors, communication, and social interactions. Such samples offer the opportunity to greatly improve our understanding of individuals, groups, and societies, but their analysis presents unique methodological challenges. In this tutorial, we discuss potential sources of such data and explain how to efficiently store them. Then, we introduce two methods that are often employed to extract patterns and reduce the dimensionality of large data sets: singular value decomposition and latent Dirichlet allocation. Finally, we demonstrate how to use dimensions or clusters extracted from data to build predictive models in a cross-validated way. The text is accompanied by examples of R code and a sample data set, allowing the reader to practice the methods discussed here. A companion website (http://dataminingtutorial.com) provides additional learning resources. (PsycINFO Database Record
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
2
3
4
5
6
|
|
|
Frontiers in Psychology
6 publications, 4.8%
|
|
|
Small Business Economics
4 publications, 3.2%
|
|
|
Annual Review of Organizational Psychology and Organizational Behavior
3 publications, 2.4%
|
|
|
SSRN Electronic Journal
3 publications, 2.4%
|
|
|
Zeitschrift fur Psychologie / Journal of Psychology
2 publications, 1.6%
|
|
|
Personality and Social Psychology Bulletin
2 publications, 1.6%
|
|
|
Behavior Research Methods
2 publications, 1.6%
|
|
|
PLoS ONE
2 publications, 1.6%
|
|
|
Environmental Science and Policy
2 publications, 1.6%
|
|
|
Personality and Individual Differences
2 publications, 1.6%
|
|
|
Journal of Organizational Behavior
2 publications, 1.6%
|
|
|
Studies in Neuroscience, Psychology and Behavioral Economics
2 publications, 1.6%
|
|
|
Lecture Notes in Networks and Systems
2 publications, 1.6%
|
|
|
Psychological Methods
2 publications, 1.6%
|
|
|
Online Information Review
1 publication, 0.8%
|
|
|
Zeitschrift fur Kinder- und Jugendpsychiatrie und Psychotherapie
1 publication, 0.8%
|
|
|
Psychologische Rundschau
1 publication, 0.8%
|
|
|
Journal of Medical Internet Research
1 publication, 0.8%
|
|
|
Cyberpsychology, Behavior, and Social Networking
1 publication, 0.8%
|
|
|
Research in Education
1 publication, 0.8%
|
|
|
Social Psychological and Personality Science
1 publication, 0.8%
|
|
|
Personality and Social Psychology Review
1 publication, 0.8%
|
|
|
Journal of Marketing
1 publication, 0.8%
|
|
|
Statistical Modelling
1 publication, 0.8%
|
|
|
International Journal of Behavioral Development
1 publication, 0.8%
|
|
|
Social Media and Society
1 publication, 0.8%
|
|
|
Tourism Economics
1 publication, 0.8%
|
|
|
Journal of Intelligence
1 publication, 0.8%
|
|
|
Big Data and Cognitive Computing
1 publication, 0.8%
|
|
|
1
2
3
4
5
6
|
Publishers
|
5
10
15
20
25
30
|
|
|
Springer Nature
28 publications, 22.4%
|
|
|
Elsevier
21 publications, 16.8%
|
|
|
SAGE
14 publications, 11.2%
|
|
|
Wiley
13 publications, 10.4%
|
|
|
Taylor & Francis
7 publications, 5.6%
|
|
|
Frontiers Media S.A.
6 publications, 4.8%
|
|
|
Annual Reviews
3 publications, 2.4%
|
|
|
Cambridge University Press
3 publications, 2.4%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
3 publications, 2.4%
|
|
|
Emerald
2 publications, 1.6%
|
|
|
Johann Ambrosius Barth
2 publications, 1.6%
|
|
|
Hogrefe Publishing Group
2 publications, 1.6%
|
|
|
JMIR Publications
2 publications, 1.6%
|
|
|
MDPI
2 publications, 1.6%
|
|
|
Public Library of Science (PLoS)
2 publications, 1.6%
|
|
|
IGI Global
2 publications, 1.6%
|
|
|
American Psychological Association (APA)
2 publications, 1.6%
|
|
|
Mary Ann Liebert
1 publication, 0.8%
|
|
|
University of Illinois Press
1 publication, 0.8%
|
|
|
CAIRN
1 publication, 0.8%
|
|
|
Social Science Electronic Publishing
1 publication, 0.8%
|
|
|
American Accounting Association
1 publication, 0.8%
|
|
|
Uniwersytet Jagiellonski - Wydawnictwo Uniwersytetu Jagiellonskiego
1 publication, 0.8%
|
|
|
Uniwersytet Lodzki (University of Lodz)
1 publication, 0.8%
|
|
|
5
10
15
20
25
30
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
125
Total citations:
125
Citations from 2025:
9
(7.2%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Kosinski M. et al. Mining big data to extract patterns and predict real-life outcomes. // Psychological Methods. 2016. Vol. 21. No. 4. pp. 493-506.
GOST all authors (up to 50)
Copy
Kosinski M., Wang Y., Lakkaraju H., Leskovec J. Mining big data to extract patterns and predict real-life outcomes. // Psychological Methods. 2016. Vol. 21. No. 4. pp. 493-506.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1037/met0000105
UR - http://doi.apa.org/getdoi.cfm?doi=10.1037/met0000105
TI - Mining big data to extract patterns and predict real-life outcomes.
T2 - Psychological Methods
AU - Kosinski, Michal
AU - Wang, YiLun
AU - Lakkaraju, Himabindu
AU - Leskovec, Jure
PY - 2016
DA - 2016/12/05
PB - American Psychological Association (APA)
SP - 493-506
IS - 4
VL - 21
PMID - 27918179
SN - 1082-989X
SN - 1939-1463
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2016_Kosinski,
author = {Michal Kosinski and YiLun Wang and Himabindu Lakkaraju and Jure Leskovec},
title = {Mining big data to extract patterns and predict real-life outcomes.},
journal = {Psychological Methods},
year = {2016},
volume = {21},
publisher = {American Psychological Association (APA)},
month = {dec},
url = {http://doi.apa.org/getdoi.cfm?doi=10.1037/met0000105},
number = {4},
pages = {493--506},
doi = {10.1037/met0000105}
}
Cite this
MLA
Copy
Kosinski, Michal, et al. “Mining big data to extract patterns and predict real-life outcomes..” Psychological Methods, vol. 21, no. 4, Dec. 2016, pp. 493-506. http://doi.apa.org/getdoi.cfm?doi=10.1037/met0000105.