The WordLens-Project
  • The WordLens-Project
  • Course Overview
  • Part 1: Transform and Visualize Data
    • 1 Working Environment
    • 2 R and the Tidyverse
    • 3 Data Loading
      • Tabular Data
      • Tidy Data
      • Exploring New Data
    • 4 Data Transformation
      • Select Columns
      • Filter Rows
      • Sort Rows
      • Add Or Change Columns
        • Calculate New Columns
        • Change Data Types
        • Rename Columns
        • Joining Data Sets
      • Summarize Rows
    • 5 Data Visualization
      • Pleas for Visualization
      • Fast and Simple Plots
      • Grammar of Graphics
  • Part 2: Rule-Based NLP
    • 6 Unstructured Data
    • 7 Searching Text
    • 8 Tokenizing Text
      • Filter or Sample Data
      • Clean and Normalize Text
      • Split Text Into Tokens
      • Removing Stop Words
      • Enrich Tokens
    • 9 Topic Classification
      • Deductive
      • Inductive
    • 10 Sentiment Analysis
    • 11 Text Classification
    • 12 Word Pairs and N-Grams
  • Part 3: NLP with Machine Learning
    • 13 Text Embeddings
    • 14 Part-Of-Speech
    • 15 Named Entities
    • 16 Syntactic Dependency
    • 17 Similarity
    • 18 Sentiment
    • 19 Text Classification
    • 20 Transformers
    • 21 Training a Model
    • 22 Large Language Models
  • Appendix
  • Resources
Powered by GitBook
On this page
  • Code Examples and Data
  • Exercises
  • Part 1: Transform and Visualize Data
  • Part 2: Rule-Based NLP
  • Part 3: NLP with Machine Learning
  • Books
  • Cheat Sheets
  • R & Python Frameworks and Libraries
  • Helpful Tools

Resources

All resources from this course bundled on a single page.

Previous22 Large Language Models

Last updated 2 years ago

Code Examples and Data

Exercises

Part 1: Transform and Visualize Data

Part 2: Rule-Based NLP

Part 3: NLP with Machine Learning

  • ...

Books

Cheat Sheets

R & Python Frameworks and Libraries

Helpful Tools

All code examples for this course on GitHub
Link to folder with data on GitHub
Practice Exercise: Explore Campusbier Sales Orders
Practice Exercise: Data Transformation with Campusbier Sales Orders
Practice Exercise: Explore and Transform REWE Products
Practice Exercise: Data Transformation with Tweets
Practice Exercise: Data Visualization with Tweets
Exercise 1: Data Transformation and Visualization
Exercise 2: Search and Extract Text Data
Exercise 3: Rule-Based Topic Classification of Newspaper Articles
Free and work-in-progress online version of "R For Data Science" by Hadley Wickham and Garret Grolemund
Free online version of the book "Hands-On Programming with R"
Free online version of the book "Text Mining with R" by Julia Silge and David Robinson
Free online version of the book "Fundamentals of Data Visualization by Claus Wilke
Posit cheat sheets on R and Python
My personal collection of links around R
Website of the Tidyverse
GitHub repository of the tidytext package
Website of the Python NLP library spaCy
ChatGPT
DeepL
Stackoverflow