-
Notifications
You must be signed in to change notification settings - Fork 3
/
blog5.html
99 lines (75 loc) · 6.01 KB
/
blog5.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
<!DOCTYPE html>
<html>
<head>
<meta charset='utf-8'>
<meta http-equiv="X-UA-Compatible" content="chrome=1">
<meta name="description" content="CS1951a : Data Science Project">
<link rel="stylesheet" type="text/css" media="screen" href="stylesheets/stylesheet.css">
<title>Blog Post 5</title>
<style>
#nokey {
z-index:-99;
top: 0;
left: -60%;
position: absolute;
height: 100%;
width: 215%;
}
</style>
</head>
<body>
<!-- HEADER -->
<div id="header_wrap" class="outer">
<header class="inner">
<a id="forkme_banner" href="https://github.com/pengyangwu/CS1951a">View on GitHub</a>
<h1 id="project_title">Blog Post #4</h1>
<h2 id="project_tagline">CS1951a : Data Science Project</h2>
<section id="downloads">
<a class="zip_download_link" href="https://github.com/pengyangwu/CS1951a/zipball/master">Download this project as a .zip file</a>
<a class="tar_download_link" href="https://github.com/pengyangwu/CS1951a/tarball/master">Download this project as a tar.gz file</a>
</section>
</header>
</div>
<!-- MAIN CONTENT -->
<div id="main_content_wrap" class="outer">
<section id="main_content" class="inner">
<h2>
<a id="command-line-youtube-data" class="anchor" href="#" aria-hidden="true"><span class="octicon octicon-link"></span></a><font color='#ff6600'>YouTube Recommendation System with Data Trends Analysis using YouTube API</font></h2>
<h4>
<a id="author-Aaron-Abhishek-Natalie-Preston-Wennie" class="anchor" href="#" aria-hidden="true"><span class="octicon octicon-link"></span></a>Author: Aaron Wu (pwu8), Abhishek Dutta (adutta2), Natalie Roe (nroe), Preston Law (plaw), Wennie Zhang (yzhang46)</h4>
<br>
<h3>
<a id="content"><span class="octicon octicon-link"></span></a>Summary</h3>
<p>We finally finished our project and presented it at the poster fair today YAY! In the end, we ended up NLTK to perform sentiment analysis on the comments of the top 50 youtube videos in our data set. We then show how the sentiments relate to the videos in our data through visualizations and the web application we created.
</p>
<img src="images/Poster_Updated/Slide1.jpg">
<p> Our web application uses JavaScript, HTML and CSS to provide as user-friendly way for users to see the tags that our predictive model assigned the videos in our data set as well as the sentiment associated with the video. For more information about how to run the web app, please refer to the README in the webapp folder of the github.
</p>
<img src="images/line_viz.png">
<p>This is one visualization that shows the sentiments of the top 10 videos in our data set trend over time. Moving the mouse along the visualization displays the video ids of each video. For more information about our visualizations please refer to the final report. </p>
<img src="images/webapp.png" align="middle">
<p>We also created a visualization that displays the number of uploads to YouTube per state in the USA. We were able to create this visualization using the longitude and latitude features associated with each video that we collected from the YouTube Data API. Using this information, we are able to visualize which states have populations that upload to YouTube more frequently than others. Interestingly, Florida is the state with the most uploads and surpasses more heavily-populated states such as California and Texas. Dragging the cursor along the bottom of the visualization enables the user to see which states have a certain number of uploads to YouTube. Please open up the visualization in a browser to see this effect.</p>
<img src="images/map_viz.png" align="middle">
<h3>
<a id="content"><span class="octicon octicon-link"></span></a>Challenges</h3>
<p>Overall, these are the stats for the predicted tags on our videos: Recall: 70.7326%, Precision: 94.2882%, F1 score: 75.6736%.</p>
<p>One of the challenges we encountered during this project was the quota limit on the YouTube API, which forced us to greatly limit the number of videos we could run our predictive model on. Nevertheless, we still got some good results despite our limitations. Another challenge the fact that we used user-defined tags to help build our predictive model. User defined tags can be very random, poorly formatted, and not actually indicative of the content of the video. In addition, the frequency of these user-defined tags is very less and it is hard to find overlapping tags to train our model (Machine Learning) to predict tags for future videos. Overcoming this obstacle was one of our biggest challenges, and we are glad that our model was still able to yield good results despite this.
</p>
<h3>
<a id="content"><span class="octicon octicon-link"></span></a>Implications</h3>
<p>We hope that in the future our web application can be used to help users become aware of the sentiments associated with a particular video with a tag before they watch the video. If a video has very negative sentiments, we hope to give the viewer a warning before they select to watch the video. This could be beneficial for parents with young children or for individuals who want to be contientious about the content that they are viewing on the web.</p>
<p>Home Page # is: <a href="https://pengyangwu.github.io/CS1951a/"> here </a>.</p>
<canvas id="nokey" width="20" height="20">If Empty, Means Your Browser Don't Support Canvas. Please Use Chrome Browser ^_^``</canvas>
<script src="javascripts/index2_line.js"> </script>
<script src='http://cdnjs.cloudflare.com/ajax/libs/jquery/2.1.3/jquery.min.js'></script>
</section>
</div>
<!-- FOOTER -->
<div id="footer_wrap" class="outer">
<footer class="inner">
<p class="copyright"> This page created and maintained by <a href="https://github.com/pengyangwu/CS1951a">Aaron Wu, Abhishek Dutta, Natalie Roe and Wennie Zhang</a></p>
<p>© 2017 YouTube Data Science Team. <a href="https://youtubedatascience.wixsite.com/youtubedatasci">See team members.</a></p>
</footer>
</div>
</body>
</html>