Issue |
SHS Web Conf.
Volume 140, 2022
2022 International Conference on Information Technology in Education and Management Engineering (ITEME2022)
|
|
---|---|---|
Article Number | 01031 | |
Number of page(s) | 8 | |
DOI | https://doi.org/10.1051/shsconf/202214001031 | |
Published online | 25 May 2022 |
Research on data outlier detection method based on sample parameter selection LOF
1
School of Computer Science, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
2
School of Electro-Mechanical Engineering, Xidian University, Xi’an 710126, China
* Corresponding author: Yinlei_w@163.com
The LOF data anomaly detection method has some defects, such as the value of k has great influence on the accuracy of detection results, and the selection of k value usually adopts trial method, which consumes a lot of calculation time. Therefore, this paper proposes an anomaly detection method for LOF data based on sample parameter selection, Tagged according to the sample data set point of normal and abnormal point, the adaptive selection of k value and outlier detection, so as to improve the accuracy of data outlier detection and calculation speed, and through the example of meteorological data outliers detection showed that LOF abnormal data points based on sample parameter selection method in the detection accuracy and reliability are improved significantly.
© The Authors, published by EDP Sciences, 2022
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.