Linear regression is a basic and commonly used type of predictive analysis. Linear Regression is a method to help us understand the relationship between two (Simple Linear Regression) or more variables (Multiple Linear Regression).
In this article, I am going to explore the data from Iowa Retail Liquor Sales which is available to access on Google BigQuery public datasets. BigQuery is a fully-managed, serverless data warehouse that enables scalable analysis over petabytes of data. It is a Platform as a Service that supports querying using ANSI SQL.
Data Science is the process and method for extracting knowledge and insights from large volumes of disparate data. It’s an interdisciplinary field involving mathematics, statistical analysis, data visualization, machine learning, and more.
Clustering is an unsupervised Machine Learning model algorithm used to group similar data points together and discover underlying patterns. Among other clustering models, k-means clustering is the most popular and easy to use.
Cluster is a group of objects that are similar to other objects in the cluster, and dissimilar to data points in other clusters. Clustering algorithms are divided into three parts:
- Partition-based Clustering. (e.g. K-Means, K-Median…
Data Analyst/Scientist | Github: jonando93