1. Introduction to Machine Learning¶

Lectruer: Dr.Fangli Ying (https://fangli-ying.github.io/)

2023.03.01 Class: Introduction to Engineering

An Overview¶

Machine learning is a significant area of study in the modern world, gaining popularity each day as an emerging trend that showcases human advancements made in recent decades. It is contributing to the developments and technologies of todays world alongside Artificial Intelligence and Data Science. Many industries and companies utilize machine learning, such as web browsers and YouTube, which have features like autocorrect and recommendation systems. Machine learning is excelling in numerous fields and has a variety of applications in almost every major industry. Before diving into mastering machine learning, understanding theoretical definitions from experts is necessary. – “This paragraph is generated from ChatGPT”

The first definition

“The field of study that gives computers the ability to learn without being explicitly programmed” (Arthur Samuel, 1959).

A second or modern interpretation of machine learning can be viewed as follows:

The Second definition

“A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.” (Tom Mitchell).

Machine learning has many definitions. But, the above two theoretical explanations should give most beginners an intuitive understanding of what we can expect from these machine learning methodologies.

Video: What is Machine Learning

Book: An Introduction to Statistical Learning

Blog: Machine Learning for Everyone

Machine Learning VS Artificial Intelligence¶

Artificial Intelligence(AI) refers to all tasks in which a computer can make decisions by mimicking human, evolutionary, genetic, or physical processes. It includes tasks such as driving a car, finding a route, diagnosing a patient, or recommending a movie. Machine learning(ML) is a part of artificial intelligence that focuses on tasks where a computer can make decisions based on data, without being explicitly programmed. While the definitions of artificial intelligence and machine learning are often confused, machine learning is a subset of artificial intelligence. Deep learning (DL) is a subset of ML that uses artificial neural networks to process large amounts of data and solve complex problems. DL is based on the structure and function of the human brain, and it has been successful in areas such as computer vision, natural language processing, and speech recognition.

In summary, AI is the broader field of creating intelligent systems, ML is a subset of AI that enables machines to learn and improve on their own through experience, and DL is a subset of ML that uses artificial neural networks to process large amounts of data and solve complex problems.

ML VS AI

Once I saw an article titled “Will neural networks replace machine learning?” on some hipster media website. The general rule is to compare things on the same level. That’s why this phase sounds like “will the wheels replace cars”

Video: Whats the Difference Between AI, Machine Learning, and Deep Learning?https://www.youtube.com/watch?v=J4Qsr93L1qs

How Machine Learning Works¶

Machine learning is the set of all tasks in which a computer can make decisions based on data. What does this mean? Let’s go back to looking at how humans make decisions. In general terms, we make decisions in the following two ways:

By using logic and reasoning
By using our experience

For example, imagine that we are trying to decide what car to buy. We can look carefully at the features of the car, such as price, fuel consumption, and navigation, and try to figure out the best combination of them that adjusts to our budget. That is using logic and reasoning. On the other hand, if we ask all of our friends what cars they own, and what they like and dislike about them, we form a list of information and use that list to decide, then we are using experience (in this case, our friends experiences).

people-computer

Machine learning represents the second method: making decisions using experience. In computer lingo, the term for experience is data. Therefore, in machine learning, computers make decisions based on data. Thus, any time we get a computer to solve a problem or make a decision using only Data, we are doing machine learning.

In short, set of techniques for giving machines the ability to to find patterns and extract rules from data, in order to:

Identify or classify elements
Detect tendencies
Make predictions

As more data is fed into the system, results get better: performance improves with experience.

programming_paradigm

training_paradigm

Basic Terminologies¶

The greater variety in the samples you have, the easier it is to find relevant patterns and predict the result. Therefore, we know that in machine learning we get the computer to learn how to solve a problem using Data, the way the computer solves the problem is by using the data to build a model. Other than data, there are several components that build up machine learning Model, to be clear let us see the process of how machine learning model is created.

process

In the figure above, the first step that we need to do to create a model is to fed the machine learning algorithm with training data. An Algorithm is a procedure, or a set of steps, used to solve a problem or perform a computation. Machine learning algorithm defines the method or way on how it learns / trains through the data, Behind the hood it is represented as a mathematical formula. After the training process, it resulting on a machine learning model. Machine learning model is a program or set of rules that is created after the machine learning algorithm learns from data and can be used to make predictions, behind the hood it is actually represented in the form of mathematical function.

model

We defined Features as the properties or characteristics of the data. If our data is in a table, the features are the columns of the table. Features could even be the colors of the pixels in a certain image. This is what describes our data. Some features are special, though, and we call them labels. In short, any property or characteristic of the data that the model can use to make predictions is called feature.

feature

Learning in machine learning is purely mathematical, and it ends by associating certain inputs with certain outputs. It has nothing to do with understanding what the algorithm has learned. (When humans analyze data, we build an understanding of the data to a certain extent.) The learning process is often described as training because the algorithm is trained to match the correct answer (the output) to every question offered (the input).