首页 1 2 3 4 5 6 7

Text-Mining-DataCamp-Analyzing Social Media Data in R

Text-Mining-DataCamp-Analyzing Social Media Data in R

1. Understanding Twitter Data

1.1 Analyzing twitter data (video)

1.2 Power of twitter data

Instruction:

# Extract live tweets for 120 seconds window
tweets120s <- stream_tweets("", timeout = 120)

# View dimensions of the data frame with live tweets
dim(tweets120s)

1.3 Pros and cons of twitter data

1.4 Extracting twitter data (video)

1.5 Prerequisites to set up the R environment

1.6 Search and extract tweets

Instruction:

# Extract tweets on "#Emmyawards" and include retweets
twts_emmy <- search_tweets("#Emmyawards", 
                 n = 2000, 
                 include_rts = TRUE, 
                 lang = "en")

# View output for the first 5 columns and 10 rows
head(twts_emmy[,1:5], 10)

1.7 Search and extract timelines

Instruction:

# Extract tweets posted by the user @Cristiano
get_cris <- get_timeline("@Cristiano", n = 3200)

# View output for the first 5 columns and 10 rows
head(get_cris[,1:5], 10)

1.8 Components of twitter data (video)

1.9 User interest and tweet counts

Instruction:

# Create a table of users and tweet counts for the topic
sc_name <- table(tweets_ai$screen_name)

# Sort the table in descending order of tweet counts
sc_name_sort <- sort(sc_name, decreasing = TRUE)

# View sorted table for top 10 users
head(sc_name_sort, 10)

1.10 Compare follower count

Instruction:

# Extract user data for the twitter accounts of 4 news sites
users <- lookup_users("nytimes", "CNN", "FoxNews", "NBCNews")

# Create a data frame of screen names and follower counts
user_df <- users[,c("screen_name","followers_count")]

# Display and compare the follower counts for the 4 news sites
user_df

1.11 Retweet counts

Instruction 1:

# Create a data frame of tweet text and retweet count
rtwt <- tweets_ai[,c("text", "retweet_count")]
head(rtwt)

# Sort data frame based on descending order of retweet counts
rtwt_sort <- arrange(rtwt, desc(retweet_count))

Instruction 2:

# Create a data frame of tweet text and retweet count
rtwt <- tweets_ai[,c("text", "retweet_count")]
head(rtwt)

# Sort data frame based on descending order of retweet counts
rtwt_sort <- arrange(rtwt, desc(retweet_count))

# Exclude rows with duplicate text from sorted data frame
rtwt_unique <- unique(rtwt_sort, by = "text")

# Print top 6 unique posts retweeted most number of times
rownames(rtwt_unique) <- NULL
head(rtwt_unique)

2. Analyzing Twitter Data

2.1 Filtering tweets (video)

2.2 Filtering for original tweets

Instruction:

# Extract 100 original tweets on "Superbowl"
tweets_org <- search_tweets("Superbowl -filter:retweets -filter:quote -filter:replies", n = 100)

# Check for presence of replies
count(tweets_org$reply_to_screen_name)

# Check for presence of quotes
count(tweets_org$is_quote)

# Check for presence of retweets
count(tweets_org$is_retweet)

2.3 Filtering on tweet language

Instruction:

在这里插入代码片

2.4 Filter based on tweet popularity

Instruction:

在这里插入代码片

2.5 Twitter user analysis

Instruction:

在这里插入代码片

2.6 Extract user information

Instruction:

在这里插入代码片

2.7 Explore users based on the golden ratio

Instruction:

在这里插入代码片

2.8 Subscribers to twitter lists

Instruction:

在这里插入代码片

2.9 Twitter trends

Instruction:

在这里插入代码片

2.10 Available trends

Instruction:

在这里插入代码片

2.11 Trends by country name

Instruction:

在这里插入代码片

2.12 Trends by city and most tweeted trends

Instruction:

在这里插入代码片

2.13 Plotting twitter data over time

Instruction:

在这里插入代码片

2.14 Visualizing frequency of tweets

Instruction:

在这里插入代码片

2.15 Create time series objects

Instruction:

在这里插入代码片

2.16 Compare tweet frequencies for two brands

Instruction:

在这里插入代码片

3. Visualize Tweet Texts

3.1 Processing twitter text

3.2 Remove URLs and characters other than letters

3.3 Build a corpus and convert to lowercase

3.4 Remove stop words and additional spaces

3.5 Visualize popular terms

3.6 Removing custom stop words

3.7 Visualize popular terms with bar plots

3.8 Word clouds for visualization

3.9 Topic modeling of tweets

3.10 The LDA algorithm

3.11 Create a document term matrix

3.12 Create a topic model

3.13 Twitter sentiment analysis

3.14 Extract sentiment scores

3.15 Perform sentiment analysis

4. Network Analysis and Putting Twitter Data on the Map

4.1 Twitter network analysis

4.2 Preparing data for a retweet network

4.3 Create a retweet network

4.4 Network centrality measures

4.5 Calculate out-degree scores

4.6 Compute the in-degree scores

4.7 Calculate the betweenness scores

4.8 Visualizing twitter networks

4.9 Create a network plot with attributes

4.10 Network plot based on centrality measure

4.11 Follower count to enhance the network plot

4.12 Putting twitter data on the map

4.13 Extract geolocation coordinates

4.14 Twitter data on the map

4.15 Course wrap-up

NVIDIA DIGITS 5.1-dev学习笔记之安装过程记录：Windows10 x64位系统、 MicroSoft Caffe Master、CUDA 8.0 、Python 2.7

ue4 创建材质

1.从外界导入模型

Esp32+Python获取天气数据+Pyechrts（Html）显示

源码链接：

基于jquery的自定义显示消息数量

根据需求简单的实现一个小功能控件，暂时不支持扩展 $（"xxxxxxx"）.iconCountPlugin（o

读写ini文件 java

1、设置变量 String configpath = "/mnt/sdcard/policenavi/Config1/Config1.ini"; FileInputStream fis = null; // 读 O

keras 入门 --手写数字识别

深度学习keras库中的helloworld： # #搭建一个简单的全连接神经网络，用于手写数字识别 # from keras.layers import Input,D

Ubuntu20.04下使用无线网卡搭建无线AP

前言：外出实施服务器的时候，大多数客户都是纯内网+非

Unity使用动画自带的

勾选Animator自带的"Apply Root Motion"，"Apply Root Motion"会将位移量直接加给挂载Animator组件的物体，如果此物体没有碰撞胶囊可以会出现该物体跑出胶囊体。如下图所示的情况。

php 开发者模式,PHP的错误处理和Magento开发者模式

在PHP中针对错误的配置有如下 1 2 3 4 log_errors display_errors error_log error_reporting e

在firefox中自定义protocol

前面的例子（为firefox添加新的protocol）虽然很详细，但是形式过