Penerapan Data Science pada Analisis Data Acara TV dan Film pada Aplikasi Layanan Streaming
Abstract
Data science is one of the sciences that can be applied to analyze data and explore data on a large scale. This study was conducted to analyze the data on TV shows and movies contained on Netflix. The dataset in this study was obtained from the Kaggle.com site uploaded by Shivam Bansal. The dataset has information from existing shows such as title, name of the director, actress and actor, country of origin, duration, genre, and description. This research was conducted on data from 2016 to 2020 by applying the stages of Exploratory Data Analysis which started with the stage of describing the problem until the results were obtained which were then displayed in the form of data visualization to help summarize the results. In this study, data analysis was also carried out to display recommendations for an impression using the Cosine Similarity algorithm, applying Clustering using K-Means, and N-Gram from the results of Clustering.