Intermediate Regular Expressions in R

0
Level

Intermediate

Language

Access

Paid

Certificate

Paid

Manipulate text data, analyze it and more by mastering regular expressions and string distances in R.

Add your review

Course Description

Analyzing data that comes in tables is fun. But what if the things that we find

are not available as a neatly organized dataset but in plain text? Do not despair: In this course, you’ll learn everything you need to know to

that will help you find all the information you need for your analyses from just a blob of text. But not only that. Using the concept of

you will learn to work even with text that contains typos or scanning errors, as you will be able to match them to their correct counterparts from other data sources (record linkage). As a learning material, we will analyze real documents about box office figures in Swiss cinemas.

What You’ll Learn

Regular Expressions: Writing Custom Patterns

Regular expressions can be pretty intimidating at first as they contain vast amounts of special characters. In this chapter, you’ll learn to decipher these and write your own patterns to find exactly what you’re looking for.

Extracting Structured Data From Text

One task where regular expressions really shine is making sense from a blob of text. In this chapter, you’ll learn to extract the information from messy data that doesn’t come in neatly arranged tables but in plain text.

Creating Strings with Data

In this chapter, we will slightly move away from regular expressions and focus on string manipulation by creating strings from other data structures like vectors or lists.

Similarities Between Strings

In the last chapter, we will shift gears away from regular expressions to understanding string distances. By calculating the differences of multiple strings, we can match those that are similar. This will help us to find duplicates even when they contain small errors like typos. This is an important part to record linkage where we combine datasets from multiple sources.

User Reviews

0.0 out of 5
0
0
0
0
0
Write a review

There are no reviews yet.

Be the first to review “Intermediate Regular Expressions in R”

×

    Your Email (required)

    Report this page
    Intermediate Regular Expressions in R
    Intermediate Regular Expressions in R
    LiveTalent.org
    Logo
    LiveTalent.org
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.