-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.qmd
65 lines (39 loc) · 4.05 KB
/
index.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
title: "Apra PD 2023 Workshop"
format: html
---
## An Introduction to Text Mining
by Matt Farrow
------------------------------------------------------------------------
🗓️August 28, 2023\
⏰ 1:00 - 4:00 pm\
🏨 JW Marriott Indianapolis: White River AB\
------------------------------------------------------------------------
# Pre-Work
## Using Posit Cloud
The materials for this workshop are available in a cloud environment that can be run with no downloads or setup. All you need to is go to [posit.cloud](https://posit.cloud) and sign up for a free account. This page will be updated on Monday, August 28 prior to the workshop with the direct link to the workshop space.
{{< qrcode https://posit.cloud/ >}}
## Workshop Link
Workshop attendees may click [this link](https://posit.cloud/spaces/402448/join?access_code=fwlAcK_yXIxTkegUsEBSHHINTmR8PtC2tQX8oIly "Text Mining Workshop Link") to join today's Posit Cloud work space. **Computing resources have been allocated only for workshop attendees.**
## GitHub Links
This website has been built from a GitHub repository located at <https://github.com/mattfarrow1/apra-intro-to-text-mining>. It contains everything used in the workshop including slides, data, scripts, and resources.
There is a second repository located at <https://github.com/mattfarrow1/text-mining-workshop> that contains only the data and scripts for the workshop. This is the repository that the Posit Cloud work space connects to.
# Overview
Free text, or unstructured data, is found throughout our CRMs in places like contact reports, notes, research profiles, emails, and more. In this workshop, we'll look at ways to get started with text mining in order to begin answering questions, surfacing insights, understanding sentiment, and bringing more value to your work. The workshop will be split into sections on Excel, R, and Python and will allow attendees the opportunity to work through examples and takeaway ideas for implementing these concepts at their own organizations.
## Competencies
| | |
|---------------|-----------------------------------------------------------------------|
| **Primary** | DS: Competency 3: Data Manipulation Skills |
| **Auxiliary** | DS: Competency 5: Visualization/Reporting Techniques and Competencies |
| **Tertiary** | DS: Competency 4: Statistical Techniques and Competencies |
## Learning Objectives
| | |
|--------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------|
| **Learning Objective 1** | Attendees will create word counts and keyword searches in Excel. |
| **Learning Objective 2** | Attendees will also utilize R (possibly Python too) to work on advanced text mining including stop words, topic modeling, word clouds, and sentiment analysis. |
> These materials are developed for a trainer-led workshop at Apra PD 2023, but should be useful for self-guided learning as well.
## Instructor
Matt Farrow is the Director of Prospect Development and Strategy at Children's Health. He previously worked as the Assistant Director of Prospect Management at Southern Methodist University, where he earned an M.S. degree in Data Science. Matt is passionate about helping people extract insights and wisdom from their data. He may also have a small problem with his love of LEGO.
Email: [matt.farrow\@childrens.com](mailto:matt.farrow@childrens.com)
------------------------------------------------------------------------
![](https://i.creativecommons.org/l/by/4.0/88x31.png) This work is licensed under a [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/).