Skip to content

This application extracts full URLs from dataframes of tweets and creates subsets with additional information based on URL Type (Websites, Twitter, Youtube, Telegram)

Notifications You must be signed in to change notification settings

JessDataNLP/Twitter-URL-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Twitter-URL-extractor: URL Expansion and Classification Tool

This Python package provides tools for expanding shortened URLs, classifying URLs based on their domain, extracting specific account information from Telegram URLs, and retrieving YouTube video titles from URLs. This package at simplyfing the analysis of hyperlinks contained in twitter text to perform content or hyper-link based network analysis.

Features

  • URL expansion from shortened forms.
  • Classification of URLs into categories: YouTube, Twitter, other social media, Telegram, or generic websites.
  • Extraction of Telegram account names from URLs.
  • Retrieval of YouTube video titles and account names.
  • Retrieval of Twitter account names from URLs

More coming soon....

About

This application extracts full URLs from dataframes of tweets and creates subsets with additional information based on URL Type (Websites, Twitter, Youtube, Telegram)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages