Definition from Wikipedia

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a…


ChatBots

An AI-based software that helps the customers quick and faster response by mimicking a human's behavior.

Benefits

  1. Cost and Time Effective
  2. Cheaper to maintenance
  3. Degree of automation for mundane tasks
  4. Increase productivity especially in Customer Support

Chatbot evolution

1st Generation — Traditional Bot

  • System driven
  • automation Scripts
  • Minimum functionality…

Applications of Regular expressions

Extracting specific text like Timestamp from logs generated like in java Log4j framework.

Basic validations on input fields in websites for client-side validations like valid email id formats, password requirements, etc.

Filtering invalid phone Numbers in Pandas DataFrame to remove incorrect phone numbers provided based on Country code, digits, etc.


Clustering

  • Is the process of dividing the dataset into groups consisting of similar data points. data in the same group are as similar as possible and dissimilar to other groups
  • used in Recommendation Engine.

Types:

  1. Exclusive Clustering- Each data point can lie in only one cluster. Ex: K-Means clustering
  2. Overlapping…

Classification is the process of grouping things according to similar features they share.

Types of Classification Algorithms
1. Logistic Regression
2. Decision Tree
3. Random Forest
4. K Nearest Neighbour
5. Naive byes
6. SVM
7. XGBOOST
  1. Logistic Regression

is used when the dependent variable(target) is categorical.

More details in…


Regression is a technique that displays the relationship between variable y based on values of x

ex: y- Inches of rain varies according to x- New Cars sold

  • If you think there is a relationship between two things, the regression would help to confirm it.

Main Types:

Linear Regression —…


Creating an algorithm that a computer then uses to find a model that fits the data as best as possible. And makes a very accurate prediction based on that

ML is different from traditional statistical methods because we do not give Instruction to find a Model, rather give the algorithm…


  • Data Visualization is the representation of data graphically or pictorially. Allows high-level Representatives to see analytics, grasp difficult concepts and identify new patterns at ease.

Ex: Kmeans clustering, Its easier to understand the cluster labeling done by an algorithm of data.Not possible to label clusters just by checking raw data.

azam sayeed

.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store