A Multi-truth Discovery Algorithm based on Statement Value Grouping and Data Source Information Richness

Authors

  • Dongjun Gao
  • Zhiyong Zhang

DOI:

https://doi.org/10.54691/eghw4742

Keywords:

Multi-truth Discovery; Value Similarity; Source Reliability.

Abstract

As the volume of data continues to grow, it is common for data from the same source to contain multiple domains. Combining domain segmentation can enhance the effectiveness of data fusion. This paper presents a multi-truth discovery algorithm that utilises statement value grouping and domain information richness. The data is first grouped based on their similarity, and the resulting groups replace the original data. Then, the reliability of each data source is calculated by domain. The truth value and reliability of each source are iteratively calculated until the end condition is met. Finally, the appropriate value is selected as the final result from the obtained dataset. Experiments were conducted on real datasets to demonstrate the algorithm's effectiveness.

Downloads

Download data is not yet available.

References

Sun P, Wang Z, Wu L, et al. Towards Personalized Privacy-Preserving Incentive for Truth Discovery in Mobile Crowdsensing Systems, IEEE Transactions on Mobile Computing, Vol. 21 (2020) No.1, p.352-365.

Tang J, Fu S, Liu X, et al. Achieving Privacy-Preserving and Lightweight Truth Discovery in Mobile Crowdsensing, IEEE Transactions on Knowledge and Data Engineering, Vol. 34 (2021) No.11, p.5140-5153.

Li Y, Sun H, Wang W H. Towards Fair Truth Discovery from Biased Crowdsourced Answers, Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2020).p.599-607.

Yin X, Han J, Yu P S. Truth Discovery with Multiple Conflicting Information Providers on the Web, IEEE Transactions on Knowledge & Data Engineering, Vol. 20 (2008) No.6, p.796-808.

Lyu S S, Ouyang W T, Wang Y Q, et al. Truth Discovery by Claim and Source Embedding, IEEE Transactions on Knowledge and Data Engineering, Vol. 33 (2021) No.3, p.1264-1275.

Ye C, Li Q, Zhang H, et al. Autorepair: An Automatic Repairing Approach over Multi-source Data, Knowledge and Information Systems, Vol. 61 (2019), p.227-257.

Chang C, CAO J J, Feng Q, et al. Truth Discovery of Multi-source Text Data, IEICE Transactions on Information and Systems, Vol. 102 (2019) No.11, p.2249-2252.

Fang X S, Sheng Q Z, Wang X Z, et al. From Appearance to Essence: Comparing Truth Discovery Methods Without Using Ground Truth, ACM Transactions on Intelligent Systems and Technology, Vol. 11 (2020) No.6, p.1-24.

Xu Y, Cao J J, Weng N F, et al. An Adaptive Truth Discovery Model Consisting of Multiple Truth Part and Single Truth Part, 2021 International Conference on Cyber-Physical Social Intelligence (2021). p.1-5.

Zhao B, Rubinstein B I P, Gemmell J, et al. A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration, Proceedings of the VLDB Endowment, Vol. 5 (2012) No.6, p.550-561.

Fang X S, Sheng Q Z, Wang X Z, et al. Smartvote: A Full-Fledged Graph-Based Model for Multi-valued Truth Discovery, World Wide Web, Vol. 22 (2019) No.4, p.1855-1885.

Lin X, Chen L. Domain-aware multi-truth discovery from conflicting sources, Proceedings of the VLDB Endowment, Vol. 11 (2018) No.5, p.635-647.

Li Q, Li Y, Gao J, et al. A Confidence-Aware Approach for Truth Discovery on Long-Tail Data, Proceedings of the VLDB Endowment, Vol. 8 (2014) No.4, p.425-436.

Downloads

Published

2024-04-30

Issue

Section

Articles

How to Cite

Gao, D., & Zhang, Z. (2024). A Multi-truth Discovery Algorithm based on Statement Value Grouping and Data Source Information Richness. Frontiers in Science and Engineering, 4(4), 217-227. https://doi.org/10.54691/eghw4742