ACM International Conference Proceeding Series, pages 6-11
Bayesian Networks-based personal data synthesis
Publication type: Proceedings Article
Publication date: 2020-09-14
—
Abstract
Often, confidentiality problems and a lack of original data, make it challenging to analyze user data carefully. In such situations, synthetic data can be used that is more suitable for testing and training marketing strategies, personalized assistants, or behavior analysis systems than the original data. In this paper, the approach for generating synthetic social media profiles data based on Bayesian networks was analyzed. The personal data synthesis problem was considered as the inference of a joint probability distribution from the oriented probabilistic models like Bayesian networks. The quality of this approach in generating VKontakte (VK is the Russian analog of Facebook) social network data was demonstrated and assessed. The Bayesian network approach has shown itself well in the tasks of deriving joint and marginal data distributions, which has led to the production of high-quality synthetic personal data.
Citations by journals
1
|
|
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering
|
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering
1 publication, 50%
|
Data Science for Transportation
|
Data Science for Transportation
1 publication, 50%
|
1
|
Citations by publishers
1
2
|
|
Springer Nature
|
Springer Nature
2 publications, 100%
|
1
2
|
- We do not take into account publications that without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.
{"yearsCitations":{"type":"bar","data":{"show":true,"labels":[2022,2023],"ids":[0,0],"codes":[0,0],"imageUrls":["",""],"datasets":[{"label":"Citations number","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":["50","50"],"barThickness":null}]},"options":{"indexAxis":"x","maintainAspectRatio":true,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":1,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Citations per year","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"journals":{"type":"bar","data":{"show":true,"labels":["Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering","Data Science for Transportation"],"ids":[6646,35580],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp","\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Journals","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"publishers":{"type":"bar","data":{"show":true,"labels":["Springer Nature"],"ids":[8],"codes":[0],"imageUrls":["\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[2],"backgroundColor":["#3B82F6"],"percentage":[100],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Publishers","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}}}
Metrics
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Deeva I. et al. Bayesian Networks-based personal data synthesis // ACM International Conference Proceeding Series. 2020. pp. 6-11.
GOST all authors (up to 50)
Copy
Deeva I., Andriushchenko P. D., Kalyuzhnaya A. V., Boukhanovsky A. V. Bayesian Networks-based personal data synthesis // ACM International Conference Proceeding Series. 2020. pp. 6-11.
Cite this
RIS
Copy
TY - CPAPER
DO - 10.1145/3411170.3411243
UR - https://doi.org/10.1145%2F3411170.3411243
TI - Bayesian Networks-based personal data synthesis
T2 - ACM International Conference Proceeding Series
AU - Deeva, Irina
AU - Andriushchenko, Petr D
AU - Kalyuzhnaya, Anna V
AU - Boukhanovsky, Alexander V
PY - 2020
DA - 2020/09/14 00:00:00
SP - 6-11
ER -
Cite this
BibTex
Copy
@inproceedings{2020_Deeva,
author = {Irina Deeva and Petr D Andriushchenko and Anna V Kalyuzhnaya and Alexander V Boukhanovsky},
title = {Bayesian Networks-based personal data synthesis},
year = {2020},
pages = {6--11},
month = {sep}
}
Profiles