(Regular Expressions) RegEx to clean text| Mind-blowing ways to clean up your text in 2021!

RegEx to clean text

RegEx to Clean Text

I used to work in Market Research and would spend hours cleaning up labels from surveys I’d programmed in online tools. The below video shows how easy it can be if you understand regular expressions and a little programming.

Regular Expressions can have a steep learning curve however it is really worth it if you continually get data that you need to clean.  You can also check out this post of mine showing how they can be used to automate your metrics.

RegEx to clean text demo

A handy cheat-sheet is downloadable from here and a wonderful course from Kevin Skoglund available from Lynda.com is available here.  Below is a great intro to the Regular Expressions.

Regex to clean text in preparation for word count in PHP – Code …

Jan 25, 2014 … a question; Anybody can answer; The best answers are voted up and rise to the top. Regex to clean string in preparation for word count in PHP …

php – Match all youtube links in a string of text – Code Review Stack …

Apr 7, 2015 Regular expressions to clean text in preparation for word count in PHP · 0 · Remove parameters from string containing URL · 4 · Normalizing strings using …

php – Regex to remove inline javascript from string – Code Review …

Aug 21, 2013 Regex to clean text in preparation for word count in PHP · 2 · Regex-ing an array · 2 · Removing stray brackets from in between shortcodes.

Web Scraping with AutoHotKey 101-Super simple ways to Get data from a page, handles & pointers

Web Scraping with AutoHotKey

Web Scraping with AutoHotkey: Intro

Being able to, programatically, navigate to an Internet page and scrape the contents in a reliable fashion is best things invented since sliced bread!   I spent years manually going through pages and copying/pasting contents from IE to Excel then spent even more time trying to clean it up.  Done properly you can get the data very, very close to how it is on the web with little effort.

The below video walks through using AutoHotKey to obtain basic values from a Web page.  It also demonstrates a script I wrote that helps write the syntax (yes I’m that lazy!)  The AutoHotKey script I wrote is further down this page and can also be found on the AHK forum here.

In this beginning tutorial I how to:
1) get a pointer to IE
2) navigate to a page
3) get text from a page

Web Scraping Intro with AutoHotkey

Here is the script writer to use during your web scraping intro with AutoHotkey.
Web Scraping with AutoHotKey

Web Scraping with AutoHotKey 1.5- troubleshooting web scraping

Web Scraping with AutoHotKey: troubleshootingWeb Scraping with AutoHotKey: Troubleshooting Web Scraping

When building my first scraping scripts I used to waste a ton of time trying to figure out what was broken.  Adding some structure to your diagnoses process can greatly speed-up detecting what has gone wrong.   A copy of the AutoHotKey syntax writer can be found here.

I think some excellent advice, not exclusive to troubleshooting web scraping, is to have a bobble-doll or something to talk to.  Pretend you’re explaining your issue to a friend and often, when you hear yourself say the words, your issue will appear to you.

This video offers some general troubleshooting tips around troubleshooting web scraping when using AutoHotKey.

Web Scraping with AutoHotKey: Troubleshooting

How to use SciTE messages to control SciTE with AutoHotkey | 63 Extremely powerful messages to control SciTE

How to use SciTE messages

How to use SciTE messages

SciTE is a great IDE that I use with AutoHotKey, SPSS, SQL, Python, XML, HTML, etc.   I love being able to use regular expressions in it to manipulate text and it has some very cool capabilities.  This video is one of my favorite demonstrations how powerful SciTE can be at manipulating text.

Here is a short tutorial and demonstration on how to manipulate SciTE editor via COM objects and Windows commands with AutoHotKey.

Tutorial How to use SciTE messages with AutoHotkey

How to customize SciTE

See the list of SciTE commands here

Take a deep-dive into Scintilla documentation

Look at the Director Interface options

To send messages in AutoHotkey review MSDN

FYI- 0x111 is the WM_COMMAND

SciTE Messages for use in AutoHotkey with a COM object

You can use spy to find wm_command

A specific version of the SciTE editor for AutoHotKey can be downloaded here and more generic documentation can be found here.