Sentiment Analysis of IMDB Movie Reviews Using Recurrent Neural Network Algorithm
Abstract
IMDb is a well-known platform that provides user reviews and ratings of various movies. The number of reviews found on IMDb is quite large, reaching thousands of reviews. Although a movie can have a high overall rating, it is still possible to receive negative reviews from some viewers. Therefore, the purpose of this sentiment classification system is to provide a benchmark for the level of sentiment contained in the movie, and hope that filmmakers can use this information as a reference in the development of their next movie. In this research, reviews from IMDb users are classified into two types, namely positive reviews and negative reviews. The program was created using the Python language with the LSTM (Long Short-Term Memory) classification model of the RNN (Recurrent Neural Network) algorithm. The purpose of using this algorithm is to measure the level of prediction accuracy in the classification process. The results of three test ratios, namely 60:40, 70:30, and 80:20, show that in the scenario of 80% data training and 20% data testing has better performance with the results accuracy of 96%, precision of 97%, recall of 98%, f1-score of 97%.
Downloads
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike International License (CC-BY-SA 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Copyright without Restrictions
The journal allows the author(s) to hold the copyright without restrictions and will retain publishing rights without restrictions.
The submitted papers are assumed to contain no proprietary material unprotected by patent or patent application; responsibility for technical content and for protection of proprietary material rests solely with the author(s) and their organizations and is not the responsibility of the ULTIMATICS or its Editorial Staff. The main (first/corresponding) author is responsible for ensuring that the article has been seen and approved by all the other authors. It is the responsibility of the author to obtain all necessary copyright release permissions for the use of any copyrighted materials in the manuscript prior to the submission.