Short From AI based Sentiment Analysis

By Oliver D'Esposito

Email: oliverdesposito@gmail.com

What is the ideal AI model for short text sentiment analysis?

Working Overview

Sentiment Analysis

A computer determining the emotional value of a body of text.

A Example

"was taking pictures in the UK and beautiful this fan walked up to me"

A Example

"was taking pictures in the UK and beautiful this fan walked up to me"

"taking pictures in the UK and horrible this fan walked up to me"

A Example

"was taking pictures in the UK and beautiful this fan walked up to me"

"taking pictures in the UK and horrible this fan walked up to me"

A Example

1.0

0.0

Neural Network

A type of ML which uses interconnected units (mathematical functions) to learn features of data. These units are organized in layers.

Neural Network

Forward Feed Neural Network (FFNN): A network in which data is only moved forward

Recurrent Neural Network (RNN): Data can be recalled therefor making it effective for sequential data

Convolutional Neural Network (CNN): recognizes patterns

Long Short Term Memory (LSTM): A RNN that fixed vanishing gradient

Gradient Recurrent Neural Network (GRU): A simper and faster version of LSTM

Why this topic

5,700

Tweets per second

192,000,000

monetizable daily active users

Social Impact

#BML & #MeToo

Other Uses

E-Commerce
Ad Campaigns
Comments
Hate

Applied Project

STEM Thesis

Goal

Collect and analyze Tweets

My process

API
CRON job
Database

How to know the sentiment of a tweet

How to get sentiment


						"#AEWRevolution 😃 -😭 -😢 -😥 -😞 -☹️ -😡"
						"#BBMA ☹️"

Extracting information

Tensorflow
Numpy
MatPlotlib

Lessons learned

Backups
Tests
Docker

Code


class trend_query_genertor():
    def __init__(self, emoji_querys, trends) -> None:
        self.emoji_querys = emoji_querys
        self.trends = trends.__iter__()

    def __iter__(self):
        return self

    def __next__(self):
        query_obj_list = []

        trend_search_term = self.trends.__next__()
        output_objs = []

        for emoji_obj in self.emoji_querys:
            trend_obj = emoji_obj.copy()

            trend_obj['query'] = f"{trend_search_term} {emoji_obj['query']} -is:retweet"            
            trend_obj['trend'] = trend_search_term

            output_objs.append(trend_obj)

        return  output_objs

data conclusions

Word Count

37.6 Average

21 Median

Paper conclusions

CNNs are best fitted for short text sentiment analysis

Opposing view

Basic models should be used
Sequence is needed for Twitter analysis

CNNs are best fitted for short text sentiment analysis

CNNs are best fitted for short text sentiment analysis

Future questions

How would a change in dialects effect my conclusion?

Is sequence more important for cultural nuance?

References

About Keras [Fact sheet]. (n.d.). Keras.io. Retrieved December 15, 2021, from https://keras.io/

     about/

Anderson, M., Barthel, M., Vogels, E. A., & Perrin, A. (2020, June). #BlackLivesMatter surges on

     Twitter after George Floyd's death. Pew Reshearch Center. Retrieved October 13, 2021, from

     https://www.pewresearch.org/fact-tank/2020/06/10/

     blacklivesmatter-surges-on-twitter-after-george-floyds-death/

Assenmacher, D., Adam, L., Trautmann, H., & Grimme, C. (2020, May). Towards real-time and

     unsupervised campaign detection in social media. In The Thirty-Third International Flairs

     Conference.

Batrinca, B., & Treleaven, P. C. (2014). Social media analytics: A survey of techniques, tools and

     platforms. AI & SOCIETY, 30(1), 89-116. https://doi.org/10.1007/s00146-014-0549-4

BLAS (Basic Linear Algebra Subprograms) [Fact sheet]. (29, June 6). netlib.org. Retrieved December

     15, 2021, from https://www.netlib.org/blas/

Browlee, J. (2019, August 12). A Tour of Machine Learning Algorithms.

     https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/

References

Chollet, F. (n.d.). Dropout layer. Keras.io/. Retrieved December 15, 2021, from https://keras.io/api/

     layers/regularization_layers/dropout/

Dettmers, T. (2015, November 3). Deep Learning in a Nutshell: Core Concepts. Deveoper Blog.

     https://developer.nvidia.com/blog/deep-learning-nutshell-core-concepts/

Ghannay, S., Favre, B., Est�ve, Y., & Camelin, N. (2016). Word Embedding Evaluation and

     Combination. LREC.

Ghelani, S. (2019, June 2). Text Classification � RNN's or CNN's? Towards Data Science.

     https://towardsdatascience.com/text-classification-rnns-or-cnn-s-98c86a0dd361

Gundecha, P., & Liu, H. (2012). Mining social media: A brief introduction. 2012 TutORials in

     Operations Research, 1-17. https://doi.org/10.1287/educ.1120.0105

Hussein, D. M. E.-D. M. (2018). A survey on sentiment analysis challenges. Journal of King Saud

     University - Engineering Sciences, 30(4), 330-338. https://doi.org/10.1016/J.JKSUES.2016.04.002

Jamal, Xianqiao, & Aldabbas. (2019). Deep learning-based sentimental analysis for large-scale

     imbalanced twitter data. Future Internet, 11(9), 190. http://dx.doi.org/10.3390/fi11090190

References

Khan, M., Malviya, A., & Yadav, S. K. (2020). Big data and social media analytics- A challenging

     approach in processing of big data. Lecture Notes in Electrical Engineering, 611-622.

     https://doi.org/10.1007/978-981-15-7961-5_59

Kirkorian, R. (2013, August 16). New Tweets per second record, and how! Twitter engineering.

     https://blog.twitter.com/engineering/en_us/a/2013/new-tweets-per-second-record-and-how

Maheshwari, A., & OpenGenus Foundation. (n.d.). SMOTE for Imbalanced Dataset. OpenGenus.

     https://iq.opengenus.org/smote-for-imbalanced-dataset/

Mehedi shamrat, F. M. J., Chakraborty, S., Imran, M. M., Muna, J. N., Billah, M. M., Das, P., &

     Rahman, M. O. (2021). Sentiment analysis on twitter tweets about covid-19 vaccines usi ng NLP

     and supervised KNN classification algorithm. Indonesian Journal of Electrical Engineering and

     Computer Science, 23(1), 463. https://doi.org/10.11591/ijeecs.v23.i1.pp463-470

Mijwil, Maad. (2018). Artificial Neural Networks Advantages and Disadvantages.

References

Nvidia. (2021). CUDA Toolkit. Nvidia.com. https://developer.nvidia.com/cuda-toolkit

Prabhakaran, S. (2019, March 23). Principal Component Analysis (PCA) � Better Explained.

     machine Learning +. https://www.machinelearningplus.com/machine-learning/

     principal-components-analysis-pca-better-explained/

Socher, R., Perelygin, A., Wu, J., Chuang, J., Manning, C. D., Ng, A. Y., & Potts, C. (2013,

     October). Recursive deep models for semantic compositionality over a sentiment treebank. In

     Proceedings of the 2013 conference on empirical methods in natural language processing (pp.

     1631-1642).

Staudemeyer, R. C., & Morris, E. R. (2019). Understanding LSTM--a tutorial into Long Short-Term

     Memory Recurrent Neural Networks. arXiv preprint arXiv:1909.09586.

T. Makkar, Y. Kumar, A. K. Dubey, �. Rocha and A. Goyal, "Analogizing time complexity of KNN and

     CNN in recognizing handwritten digits," 2017 Fourth International Conference on Image

     Information Processing (ICIIP), 2017, pp. 1-6, doi: 10.1109/ICIIP.2017.8313707.

References

Tan, W., Wang, X., & Xu, X. (2018). Sentiment Analysis for Amazon Reviews.

     https://cs229.stanford.edu/proj2018/report/122.pdf

Twitter Inverstor Relations. (2021, October 26). Selected Company Metrics and Financials [Table].

     Twitter Investor Relations. https://s22.q4cdn.com/826641620/files/doc_financials/2021/q3/

     Final-Selected-Metrics-and-Financials.pdf

Varol, O., Ferrara, E., Menczer, F., & Flammini, A. (2017). Early detection of promoted campaigns on

     social media. EPJ Data Science, 6(1). http://dx.doi.org/10.1140/epjds/s13688-017-0111-y

West, S. (2021, July 22). Digital Activism: Social Movement on Social Media (M. Li, Ed.). Psychology

     Today. Retrieved October 13, 2021, from https://www.psychologytoday.com/us/blog/

     understanding-the-social-world/202107/digital-activism-social-movement-social-media

Whiting, J. (2020). Tweets show what hinders reports of sexual assault and harassment on campus -

     and why the new federal title IX rules may be a step back. The Conversation : Education.

References

Why TensorFlow [Fact sheet]. (n.d.). TensorFlow.org. Retrieved December 15, 2021, from

     https://www.tensorflow.org/about

Yin, W., Kann, K., Yu, M., & Sch�tze, H. (2017). Comparative study of CNN and RNN for natural

     language processing. arXiv preprint arXiv:1702.01923.

Zhang, K. (2018). LSTM: An Image Classification Model Based on Fashion-MNIST Dataset.

Zhuang, Y., Thiagarajan, A., & Sweeney, T. (2019, March 4). Ranking Tweets with TensorFlow.

     TensorFlow Blog. https://blog.tensorflow.org/2019/03/ranking-tweets-with-tensorflow.html

Thank you

Short From AI based Sentiment Analysis

Working Overview

Sentiment Analysis

A Example

A Example

A Example

A Example

A Example

Neural Network

Neural Network

Why this topic

5,700

192,000,000

Social Impact

Other Uses

Applied Project

STEM Thesis

Goal

Extracting information

Lessons learned

Code

data conclusions

Word Count

Word Count

Paper conclusions

Opposing view

Future questions

References

References

References

References

References

References

Questions?