This is for my reference and this might come in handy for others too. All Tesseract options $ tesseract –help-extra CLI Examples Command Example Notes…

In this article, I will explain a widely used method for segmenting handwritten documents into individual lines. Below is a sample output from my algorithm.…

An important pre-processing step in any OCR tool or algorithm is to deskew the scanned document first. Take a look at the below sample scanned…

Principal component analysis is a method used to reduce the number of dimensions in a dataset without losing much information. It’s used in many fields…

Read my previous post to understand how K-Means algorithm works. In this post I will try to run the K-Means on Iris dataset to classify…

K-Means is one of the simplest unsupervised clustering algorithm which is used to cluster our data into K number of clusters. The algorithm iteratively assigns…

Support Vector machines (SVM) can be used for both classification as well as regression tasks but they are mostly used in classification applications. Some of…

Polynomial regression is a process of finding a polynomial function that takes the form f( x ) = c0 + c1 x + c2 x2 ⋯ cn xn where n is the degree of the polynomial and c is a set of coefficients. Through…

Previously we derived a simple linear regression modal for our Pizza price dataset. We built a modal that predicted a price of $13.68 for a 12 inch…

Simple linear regression is a statistical method you can use to study relationships between two continuous (quantitative) variables: independent variable (x) – also referred to as predictor…