Leveraging Big Data Analytics and Machine Learning Techniques for Sentiment Analysis of Amazon Product Reviews in Business Insights
DOI:
https://doi.org/10.47672/ajce.2612Keywords:
Sentiment analysis, big data, business, Machine learning, Amazon product review dataset, BOWAbstract
Purpose: Satisfactory consumer feedback results from sentiment research which enables product quality enhancement. The research examines Amazon product review data through machine learning methods for sentiment analysis to extract important insights that improve customer experience.
Materials and Methods: A Gradient Boost Classifier stands at the core of the proposed method which conducts sentiment analysis operations. The preliminary data treatment includes punctuation removal and stop word filtering followed by text tokenization. Feature extraction is performed using the Bag of Words (BoW) technique. The data is split into training and testing sets, and the models are evaluated using F1-score, recall, accuracy, and precision. Comparative analysis is conducted with Logistic Regression (LR), Naïve Bayes (NB), and Recursive Neural Network for Multiple Sentences (RNNMS).
Findings: Among the tested models, the Gradient Boost Classifier consistently outperforms others, achieving a robust performance of 82% across all evaluation metrics. This highlights its superior classification capability in sentiment analysis tasks.
Unique Contributions to Theory, Practice and Policy: While Gradient Boosting demonstrates high accuracy, future research could explore more advanced models and techniques, such as transformer-based architectures, to enhance sentiment classification across diverse product categories and address more nuanced sentiment patterns
Downloads
References
A. Ejaz, Z. Turabee, M. Rahim, and S. Khoja, “Opinion mining approaches on Amazon product reviews: A comparative study,” in 2017 International Conference on Information and Communication Technologies, ICICT 2017, 2017. doi: 10.1109/ICICT.2017.8320185.
V. V. Kumar, S. R. Yadav, F. W. Liou, and S. N. Balakrishnan, “A digital interface for the part designers and the fixture designers for a reconfigurable assembly system,” Math. Probl. Eng., 2013, doi: 10.1155/2013/943702.
V. Kumar, V. V. Kumar, N. Mishra, F. T. S. Chan, and B. Gnanasekar, “Warranty failure analysis in service supply Chain a multi-agent framework,” in SCMIS 2010 - Proceedings of 2010 8th International Conference on Supply Chain Management and Information Systems: Logistics Systems and Engineering, 2010.
X. Fang and J. Zhan, “Sentiment analysis using product review data,” J. Big Data, 2015, doi: 10.1186/s40537-015-0015-2.
S. Vanaja and M. Belwal, “Aspect-Level Sentiment Analysis on E-Commerce Data,” in Proceedings of the International Conference on Inventive Research in Computing Applications, ICIRCA 2018, 2018. doi: 10.1109/ICIRCA.2018.8597286.
S. Paknejad and A. Gravano, “Sentiment classification on Amazon reviews using machine learning approaches,” Degree Proj. Technol., vol. 70, no. 8, pp. 1–25, 2018.
M. Z. Hasan, R. Fink, M. R. Suyambu, and M. K. Baskaran, “Assessment and improvement of intelligent controllers for elevator energy efficiency,” in IEEE International Conference on Electro Information Technology, 2012. doi: 10.1109/EIT.2012.6220727.
T. Shaikh and D. Deshpande, “Feature Selection Methods in Sentiment Analysis and Sentiment Classification of Amazon Product Reviews,” Int. J. Comput. Trends Technol., 2016, doi: 10.14445/22312803/ijctt-v36p139.
V. V. Kumar, M. K. Pandey, M. K. Tiwari, and D. Ben-Arieh, “Simultaneous optimization of parts and operations sequences in SSMS: A chaos embedded Taguchi particle swarm optimization approach,” J. Intell. Manuf., 2010, doi: 10.1007/s10845-008-0175-4.
C. W. Park and D. R. Seo, “Sentiment analysis of Twitter corpus related to artificial intelligence assistants,” in 2018 5th International Conference on Industrial Engineering and Applications, ICIEA 2018, 2018. doi: 10.1109/IEA.2018.8387151.
W. Y. Chong, B. Selvaretnam, and L. K. Soon, “Natural Language Processing for Sentiment Analysis: An Exploratory Analysis on Tweets,” in Proceedings - 2014 4th International Conference on Artificial Intelligence with Applications in Engineering and Technology, ICAIET 2014, 2014. doi: 10.1109/ICAIET.2014.43.
K. Jain and S. Kaushal, “A Comparative Study of Machine Learning and Deep Learning Techniques for Sentiment Analysis,” in 2018 7th International Conference on Reliability, Infocom Technologies and Optimization: Trends and Future Directions, ICRITO 2018, 2018. doi: 10.1109/ICRITO.2018.8748793.
Y. Chen and Z. Zhang, “Research on text sentiment analysis based on CNNs and SVM,” in Proceedings of the 13th IEEE Conference on Industrial Electronics and Applications, ICIEA 2018, 2018. doi: 10.1109/ICIEA.2018.8398173.
A. Ilmania, Abdurrahman, S. Cahyawijaya, and A. Purwarianti, “Aspect Detection and Sentiment Classification Using Deep Neural Network for Indonesian Aspect-Based Sentiment Analysis,” in Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018, 2019. doi: 10.1109/IALP.2018.8629181.
J. Kaur and B. K. Sidhu, “Sentiment Analysis Based on Deep Learning Approaches,” in Proceedings of the 2nd International Conference on Intelligent Computing and Control Systems, ICICCS 2018, 2018. doi: 10.1109/ICCONS.2018.8662899.
Z. Hu, J. Hu, W. Ding, and X. Zheng, “Review Sentiment Analysis Based on Deep Learning,” in Proceedings - 12th IEEE International Conference on E-Business Engineering, ICEBE 2015, 2015. doi: 10.1109/ICEBE.2015.24.
M. Pota, M. Esposito, M. A. Palomino, and G. L. Masala, “A subword-based deep learning approach for sentiment analysis of political tweets,” in Proceedings - 32nd IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2018, 2018. doi: 10.1109/WAINA.2018.00162.
Z. Singla, S. Randhawa, and S. Jain, “Sentiment analysis of customer product reviews using machine learning,” in Proceedings of 2017 International Conference on Intelligent Computing and Control, I2C2 2017, 2018. doi: 10.1109/I2C2.2017.8321910.
C. P. B. . F. K. . K. B. Rain, “Sentiment Analysis in Amazon Reviews Using Probabilistic Machine Learning,” Compu- tational Linguist. Intell. Text Process., vol. 4, no. 1, pp. 475–487, 2013.
A. Bali, P. Agarwal, G. Poddar, D. Harsole, and N. M., “Consumer’s Sentiment Analysis of Popular Phone Brands and Operating System Preference,” Int. J. Comput. Appl., 2016, doi: 10.5120/ijca2016912288.
T. U. Haque, N. N. Saber, and F. M. Shah, “Sentiment analysis on large scale Amazon product reviews,” in 2018 IEEE International Conference on Innovative Research and Development, ICIRD 2018, 2018. doi: 10.1109/ICIRD.2018.8376299.
S. Wladislav, Z. Johannes, W. Christian, K. Andre, and F. Madjid, “Sentilyzer: Aspect-oriented sentiment analysis of product reviews,” in Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018, 2018. doi: 10.1109/CSCI46756.2018.00059.
Z. Singla, S. Randhawa, and S. Jain, “Statistical and sentiment analysis of consumer product reviews,” in 8th International Conference on Computing, Communications and Networking Technologies, ICCCNT 2017, 2017. doi: 10.1109/ICCCNT.2017.8203960.
V. Kolluri, “A Pioneering Approach to Forensic Insights: Utilization AI For Cybersecurity Incident Investigations,” IJRAR - Int. J. Res. Anal. Rev. (IJRAR), E-ISSN 2348-1269, pp. 2348–1269, 2016.
M. Z. Hasan, R. Fink, M. R. Suyambu, M. K. Baskaran, D. James, and J. Gamboa, “Performance evaluation of energy efficient intelligent elevator controllers,” in IEEE International Conference on Electro Information Technology, 2015. doi: 10.1109/EIT.2015.7293320.
V. V Kumar, “An interactive product development model in remanufacturing environment : a chaos-based artificial bee colony approach,” Spring, 2014, [Online]. Available: https://scholarsmine.mst.edu/cgi/viewcontent.cgi?article=8243&context=masters_theses
V. KOLLURI, “A COMPREHENSIVE ANALYSIS ON EXPLAINABLE AND ETHICAL MACHINE: DEMYSTIFYING ADVANCES IN ARTIFICIAL INTELLIGENCE,” no. August, 2015.
V. V. Kumar, F. W. Liou, S. N. Balakrishnan, and V. Kumar, “Economical impact of RFID implementation in remanufacturing: a Chaos-based Interactive Artificial Bee Colony approach,” J. Intell. Manuf., 2015, doi: 10.1007/s10845-013-0836-9.
Y. Al Amrani, M. Lazaar, and K. E. El Kadirp, “Random forest and support vector machine based hybrid approach to sentiment analysis,” in Procedia Computer Science, 2018. doi: 10.1016/j.procs.2018.01.150.
V. V. Kumar and F. T. S. Chan, “A superiority search and optimisation algorithm to solve RFID and an environmental factor embedded closed loop logistics model,” Int. J. Prod. Res., 2011, doi: 10.1080/00207543.2010.503201.
J. Wu and T. Ji, “2016_Deep Learning for Amazon Food Review Sentiment Analysis,” pp. 1–8, 2016.
Sarisa, M., Boddapati, V. N., Patra, G. K., Kuraku, C., Konkimalla, S., & Rajaram, S. K. (2020). An Effective Predicting E-Commerce Sales & Management System Based on Machine Learning Methods. Journal of Artificial Intelligence and Big Data, 1(1), 75–85. Retrieved from
Gollangi, H. K., Bauskar, S. R., Madhavaram, C. R., Galla, E. P., Sunkara, J. R., & Reddy, M. S. (2020). Exploring AI Algorithms for Cancer Classification and Prediction Using Electronic Health Records. Journal of Artificial Intelligence and Big Data, 1(1), 65–74.
Manikanth Sarisa, Venkata Nagesh Boddapati, Gagan Kumar Patra, Chandrababu Kuraku, Siddharth Konkimalla, Shravan Kumar Rajaram.Navigating the Complexities of Cyber Threats, Sentiment, and Health with AI/ML. (2020). JOURNAL OF RECENT TRENDS IN COMPUTER SCIENCE AND ENGINEERING (JRTCSE), 8(2), 22-40. https://doi.org/10.70589/JRTCSE.2020.2.3
Gollangi, H. K., Bauskar, S. R., Madhavaram, C. R., Galla, E. P., Sunkara, J. R., & Reddy, M. S. (2020).Unveiling the Hidden Patterns: AI-Driven Innovations in Image Processing and Acoustic Signal Detection. (2020). JOURNAL OF RECENT TRENDS IN COMPUTER SCIENCE AND ENGINEERING ( JRTCSE), 8(1), 25-45. https://doi.org/10.70589/JRTCSE.2020.1.3.
Hemanth Kumar Gollangi, Sanjay Ramdas Bauskar, Chandrakanth Rao Madhavaram, Eswar Prasad Galla, Janardhana Rao Sunkara and Mohit Surender Reddy (2020). “Echoes in Pixels: The intersection of Image Processing and Sound detection through the lens of AI and Ml”, International Journal of Development Research. 10, (08),39735-39743. https://doi.org/10.37118/ijdr.28839.28.2020.
Manikanth Sarisa, Venkata Nagesh Boddapati, Gagan Kumar Patra, Chandrababu Kuraku, Siddharth Konkimalla and Shravan Kumar Rajaram. “The power of sentiment: big data analytics meets machine learning for emotional insights”, International Journal of Development Research, 10, (10), 41565-41573. https://doi.org/10.37118/ijdr.28840.10.2020
Mohit Surender Reddy, Manikanth Sarisa, Siddharth Konkimalla, Sanjay Ramdas Bauskar, Hemanth Kumar Gollangi, Eswar Prasad Galla, Shravan Kumar Rajaram, 2021. "Predicting tomorrow’s Ailments: How AI/ML Is Transforming Disease Forecasting", ESP Journal of Engineering & Technology Advancements, 1(2): 188-200.
K. Gollangi, S. R. Bauskar, C. R. Madhavaram, P. Galla, J. R. Sunkara, and M. S. Reddy, “ECHOES IN PIXELS: THE INTERSECTION OF IMAGE PROCESSING AND SOUND OPEN ACCESS ECHOES IN PIXELS: THE INTERSECTION OF IMAGE PROCESSING AND SOUND DETECTION,” Int. J. Dev. Res., vol. 10, no. 08, pp. 39735–39743, 2020, doi: 10.37118/ijdr.28839.28.2020.
Gollangi, H. K., Bauskar, S. R., Madhavaram, C. R., Galla, E. P., Sunkara, J. R., & Reddy, M. S. (2020). Unveiling the Hidden Patterns: AI-Driven Innovations in Image Processing and Acoustic Signal Detection. (2020). JOURNAL OF RECENT TRENDS IN COMPUTER SCIENCE AND ENGINEERING (JRTCSE), 8(1), 25- 45. https://doi.org/10.70589/JRTCSE.2020.1.3.
Gollangi, H. K., Bauskar, S. R., Madhavaram, C. R., Galla, E. P., Sunkara, J. R., & Reddy, M. S. (2020). Exploring AI Algorithms for Cancer Classification and Prediction Using Electronic Health Records. Journal of Artificial Intelligence and Big Data, 1(1), 65–74. Retrieved from https://www.scipublications.com/journal/index.php/jaibd/article/view/1109
Gagan Kumar Patra, Shravan Kumar Rajaram, & Venkata Nagesh Boddapati. (2019). Ai And Big Data in Digital Payments: A Comprehensive Model For Secure Biometric Authentication. Educational Administration: Theory and Practice, 25(4), 773–781. https://doi.org/10.53555/kuey.v25i4.7591
Chandrababu Kuraku, Hemanth Kumar Gollangi, & Janardhana Rao Sunkara. (2020). Biometric Authentication In Digital Payments: Utilizing AI And Big Data For Real-Time Security And Efficiency. Educational Administration: Theory and Practice, 26(4), 954–964. https://doi.org/10.53555/kuey.v26i4.7590
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Niharika Katnapally, Purna Chandra Rao Chinta, Kishan Kumar Routhu, Vasu Velaga, Varun Bodepudi, Laxmana Murthy Karaka

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution (CC-BY) 4.0 License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.