Javascript must be enabled to continue!
Data Mining applied on Web Robots Detection: A Systematic Mapping
View through CrossRef
Browsing on Internet is part of the world population’s daily routine. The number of web pages is increasing and so is the amount of published content (news, tutorials, images, videos) provided by them. Search engines use web robots to index web contents and to offer better results to their users. However, web robots have also been used for exploiting vulnerabilities in web pages. Thus, monitoring and detecting web robots’ accesses is important in order to keep the web server as safe as possible. Data Mining methods have been applied to web server logs (used as data source) in order to detect web robots. Then, the main objective of this work was to observe evidences of definition or use of web robots detection by analyzing web server-side logs using Data Mining methods. Thus, we conducted a systematic Literature mapping, analyzing papers published between 2013 and 2020. In the systematic mapping, we analyzed 34 studies and they allowed us to better understand the area of web robots detection, mapping what is being done, the data used to perform web robots detection, the tools, and algorithms used in the Literature. From those studies, we extracted 33 machine learning algorithms, 64 features, and 13 tools. This study is helpful for researchers to find machine learning algorithms, features, and tools to detect web robots by analyzing web server logs.
Title: Data Mining applied on Web Robots Detection: A Systematic Mapping
Description:
Browsing on Internet is part of the world population’s daily routine.
The number of web pages is increasing and so is the amount of published content (news, tutorials, images, videos) provided by them.
Search engines use web robots to index web contents and to offer better results to their users.
However, web robots have also been used for exploiting vulnerabilities in web pages.
Thus, monitoring and detecting web robots’ accesses is important in order to keep the web server as safe as possible.
Data Mining methods have been applied to web server logs (used as data source) in order to detect web robots.
Then, the main objective of this work was to observe evidences of definition or use of web robots detection by analyzing web server-side logs using Data Mining methods.
Thus, we conducted a systematic Literature mapping, analyzing papers published between 2013 and 2020.
In the systematic mapping, we analyzed 34 studies and they allowed us to better understand the area of web robots detection, mapping what is being done, the data used to perform web robots detection, the tools, and algorithms used in the Literature.
From those studies, we extracted 33 machine learning algorithms, 64 features, and 13 tools.
This study is helpful for researchers to find machine learning algorithms, features, and tools to detect web robots by analyzing web server logs.
Related Results
Research Status and Development Trend of Multi-arm Collaborative Robots
Research Status and Development Trend of Multi-arm Collaborative Robots
Industrial robots are mainly used in metal forming, automotive, and electrical and electronics
industries. After decades of unremitting efforts, industrial robots have achieved gre...
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
This review summarizes the evidence from six randomized controlled trials that judged the effectiveness of systematic review summaries on policymakers' decision making, or the most...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Parallel robots with unconventional joints to achieve under-actuation and reconfigurability
Parallel robots with unconventional joints to achieve under-actuation and reconfigurability
The aim of the thesis is to define, analyze, and verify through simulations and practical implementations, parallel robots with unconventional joints that allow them to be under-ac...
Agricultural Robots for Harvesting and Planting
Agricultural Robots for Harvesting and Planting
The agricultural sector is at the forefront of technological innovation, seeking sustainable solutions to address the increasing demand for food production in the face of populatio...
Eyes on Air
Eyes on Air
Abstract
We at ADNOC Logistics & Services have identified the need for a Fully Integrated Inspection and Monitoring Solution to meet our operational, safety and ...
Optimisation of potash mining technology for cell and pillar mining method
Optimisation of potash mining technology for cell and pillar mining method
The diverse demand for inorganic fertilizers has predetermined the intensification of potash mining, which is a raw material for their production. In this regard, it has become nec...

