Intelligent Data Capture with Automated Tools

A course by Rocío Usero López
Internal training for URJC teaching and research staff

Description

This course offers a practical introduction to web scraping, geared towards teachers and researchers who want to learn how to extract information from web pages without programming. Throughout the course, participants will explore low- and no-code tools that simplify data collection and organization, such as Instant Data Scraper, Data Miner, and Octoparse. They will understand the fundamentals of web scraping, its applications in teaching and research, and the legal and ethical considerations involved in data extraction. Through practical exercises, these tools will be applied to gather information from news sites, academic repositories, and open data portals. By the end of the course, participants will be able to automate data collection processes and use the results for trend analysis, comparative studies, and informed decision-making in academia.

What you will learn

This course is designed to equip participants with the skills necessary to extract and process data from web pages using visual and low-code tools. Throughout the course, participants will learn the conceptual foundations, ethical considerations, and practical applications of web scraping in academic and research contexts.

During the course, we will work on:

  • Understanding what web scraping is and its main applications in teaching and research
  • Recognize the legal and ethical limitations of data mining on the web
  • Learn and use no-code and low-code tools such as Instant Data Scraper, Data Miner, and Octoparse
  • Design data extraction flows adapted to different types of web pages
  • Collect and organize information from sources such as news portals, academic repositories, or open data

Requirements

  • No prior knowledge is necessary, as this is an introductory course aimed at people interested in learning how to extract web data without programming code
  • This may be of particular interest to researchers whose research is based on the collection of massive amounts of data through websites

Faculty

Rocío Usero López

King Juan Carlos University

A researcher at Rey Juan Carlos University in the Business Studies area, specializing in digital transformation and agile methodologies. She belongs to the Department of Business Economics, Business Organization area. She has participated in digitization projects, automating business processes and creating chatbots.

Frequently Asked Questions

What type of audience might be interested in taking the course?

Researchers from King Juan Carlos University.

Automation of web data extraction.

Collection of massive amounts of web data.

You can download your course completion certificate free of charge once you have completed all the required course activities. The certificate will confirm your successful completion of the course and will include the total number of hours.

To enroll in this course, simply log in or create your account and then click on the Start.

Current versions of Chrome, Firefox, Safari, or Internet Explorer version 9 or higher.

Enrollment and participation in a URJCx course is free. There are absolutely no academic penalties for dropping out. You can enroll in the same and/or other courses (as long as they are still being offered) at a later time.

This course is designed to be self-paced. There's no need to start at a specific time, although a learning pace of one topic per week is recommended.

At the end of the course you will be assessed with a test on the basic concepts learned.

Courses that might interest you

Don't miss a thing

Subscribe to the newsletter

Receive information about new courses and news