Need a regular expression to recognize the 50 US state names or USPS abbrevations? Here it is!
Continue reading “Regex for 50 US States”emoji4j v15.0.1 Released
A new version v15.0.1 of my emoji processing library, emoji4j, for Java 8+ just dropped. Here are the updates:
- Update to Unicode 15
- New method
GraphemeMatcher#results()
- Imroved documentation
- Even more tests
There is also now a Cookbook in the emoji4j wiki to help users solve hard or common problems with emoji4j.
Enjoy!
Parsing Rich Social Media Text
If you ever need to convert “plain” social media text like:
Hey @importantguy, check out my project https://www.mycoolproject.com/ #PrettyPlease
into “rich” social media text like:
Hey @importantguy, check out my project https://www.mycoolproject.com/ #PrettyPlease
Then you need a social text parsing library. The industry standard is twitter/twitter-text
, but it doesn’t work for everything. (For example, it only parses valid Twitter mentions, but will not parse all valid TikTok mentions, since those screen names can contain a “.”.) So while you may need to customize for your specific use case(s), this post should at least give you a good starting point.