Paper
14 June 2023 A web crawler-based method for collecting information on investment promotion enterprises
Jingyao Sun, Shengnan Zhang, mengli Dai
Author Affiliations +
Proceedings Volume 12708, 3rd International Conference on Internet of Things and Smart City (IoTSC 2023); 127081U (2023) https://doi.org/10.1117/12.2683931
Event: 3rd International Conference on Internet of Things and Smart City (IoTSC 2023), 2023, Chongqing, China
Abstract
Data acquisition is a prerequisite for performing big data analytics. However, as the diversity and timeliness of data increase, the complexity of data collection also increases. In this paper, we take enterprise data on a big data investment platform as the research object, and design two data collection models, static data collection based on incremental crawlers and dynamic data collection based on query topic crawlers, for the static and dynamic characteristics of this data. In the experiments, this paper tests the effectiveness of these two web crawler methods and proves that they can collect static and dynamic investment data comprehensively and accurately. Thus, this study provides an effective data collection scheme that helps improve the accuracy and reliability of big data analysis.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jingyao Sun, Shengnan Zhang, and mengli Dai "A web crawler-based method for collecting information on investment promotion enterprises", Proc. SPIE 12708, 3rd International Conference on Internet of Things and Smart City (IoTSC 2023), 127081U (14 June 2023); https://doi.org/10.1117/12.2683931
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Databases

Industry

Patents

Data acquisition

Windows

Data processing

Back to Top