What is transfer learning and how is it used to pre-train models for new tasks?

IONOS editorial team2025-03-255 mins

Transfer learning is an approach where a pre-trained model is applied to a new, related task. This method helps save time and resources while enhancing the performance of machine learning models. There are various strategies for adapting pre-trained models to fit a new task.

What is transfer learning?

Transfer learning is a method from machine learning where a trained model is optimized for a new, similar task. Rather than training a new model from scratch for a specific task, the existing knowledge is used. Through minor adjustments, the pre-trained model is adapted to the new task, enabling it to handle different features. This approach saves time and resources, as it requires much smaller datasets for training, making it both more efficient and powerful.

AI Tools at IONOS

Empower your digital journey with AI

Get online faster with AI tools
Fast-track growth with AI marketing
Save time, maximize results

How does transfer learning work?

Transfer learning involves taking a model that has already been fully trained for a specific task and applying it to a new, similar task. This method is especially effective when working with unstructured data, such as images or videos. For example, a model trained to recognize images of cars can be adapted to identify trucks, as many features, such as wheels, doors, and overall shape, are shared between the two categories.

Selecting a trained model

As a starting point, you need a pre-trained model, which is created by training on a large dataset with labeled examples. The model learns to recognize patterns and relationships in the data, allowing it to perform the intended task. In machine learning, this process involves layers that are interconnected and used to perform calculations. The more layers a model has, the more complex patterns it can recognize.

In transfer learning, you choose a model that has already successfully completed this process. It’s important to closely examine the source task of the existing model. The more similar it is to the new task, the easier it will be to adapt the model for the new application.

Reconfiguring and training the model

The second step is to configure the pre-trained model for the new task. There are generally two practical methods for this, and you can choose the one that best suits your needs.

In the first method, the last layer of the trained output model is replaced. This layer, also called the output layer, acts as the final classification unit, determining whether a given input matches the trained parameters. For example, this layer might decide whether an image represents a car. In many cases, you can remove this layer and replace it with a new one that is tailored to your specific application. In our example, the new layer would be designed to identify trucks instead of cars.

Alternatively, with transfer learning, it’s possible to freeze the previous parameters and add new layers instead. These new layers are specifically designed to align with the new task and are integrated into the model. The adapted model is then trained with a much smaller dataset containing the relevant examples. During this training, the model recognizes patterns and relationships while leveraging the knowledge gained from the original training.

Checking progress

The final step must be carried out in any case: You can only train the AI for the new task through conscientious monitoring and, if necessary, adjustments to the training material and possibly the new shifts. If the parameters are adjusted during training, the accuracy will also increase and the model will learn to meet the new requirements.

What are the different strategies?

There are different strategies for the use of transfer learning. Which one is right for you depends primarily on the desired purpose. These are some approaches:

Feature extraction: In feature extraction, you use the pre-trained model to extract basic features, such as textures, while the new layers are designed to recognize more specific features. This approach is useful when the source and target tasks have significant overlap.
Inductive transfer learning: In this case, the source and target domains are the same, but the tasks differ. This method allows new functions to be trained quicker, as the model can leverage knowledge from the source task to improve learning in the target task.
Transductive transfer learning: In this strategy, the knowledge gained from the source task is transferred directly to specific instances of the new task, for example in order to be able to classify them better. This approach is promising if the source and target tasks have comparatively few similarities.
Unsupervised transfer learning: Here, the source and target domains are similar, but the tasks differ. However, no labeled data is provided. Instead, the model learns the differences and similarities of the unlabeled data, enabling it to generalize and make predictions based on this information.
Multitasking: In this approach, a model simultaneously performs multiple tasks that are not identical but are related to each other. This enables shared knowledge.
Prediction: In this form of transfer learning, the model is supposed to fill in certain missing aspects of the data itself. For example, words within a sentence are predicted. The results are then improved through fine-tuning.
Zero-shot and Few-shot: This is also a form of transfer learning in the field of generative AI, in which knowledge from a source is to be transferred to a target if there are only a few overlaps (Few-Shot) or no overlaps at all (Zero-Shot) between the two. The method is used when only very little training data is available.
Disentanglement: For this approach, data is split into different factors. The model can then consider and manipulate style and content separately, for example.

What areas of application does transfer learning have?

There are numerous potential areas of application for transfer learning. The method offers significant savings in cost, time, and resources, making it highly advantageous. The most important applications to date include:

image recognition
speech recognition
object localization
diagnostics in the healthcare sector

In the future, however, transfer learning is likely to be applied in many other areas as well.

Was this article helpful?

whitehouneShutterstock

Deep Learning: When the human brain becomes a model

Deep learning, along with machine learning and artificial intelligence, is one of the main buzzwords in information technology. In discourse, though, the terms often get mixed up. What are the differences? What is the relationship between the methods? We provide you with an…

Encyclopedia
AI

Laurent Tshutterstock

Deep learning vs. machine learning – what are the differences?

Ever more technologies incorporate artificial intelligence (AI). But to understand how it’s possible for Alexa and Siri to provide us with the right answers to our queries or deliver personalized music recommendations, we need to understand the concepts behind AI. Machine…

AI
Comparison

kentohShutterstock

What is FLoC (Federated Learning of Cohorts)?

Google plans to axe third-party cookies that are used to generate individual user profiles from its Chrome browser over the coming years. Advertising and tracking, on the other hand, will continue to be available, which is why the search engine giant has been busy with Federated…

Security
Tutorials

PeshkovaShutterstock

Self-learning systems: how does machine learning work?

Artificial intelligence has fascinated us for centuries: Countless books and films about robots and computers that behave like (or not like) humans testify to both the enthusiasm and concern we have for artificial intelligence. But how do machines get so smart? We explain how…

Encyclopedia
AI

Titima OngkantongShutterstock

Supervised learning: a lesson plan for machines

Artificial intelligence is forecast to play a major role in the future of technology. It enables machines to solve problems on their own instead of waiting for human input. To do this, however, the algorithms must first be trained. Supervised learning gives developers complete…

Encyclopedia
AI

pixelparticleShutterstock

What is unsupervised learning?

Unsupervised learning teaches algorithms to make their own decisions while continuously improving at doing so. This is made possible through artificial intelligence. But how do machines learn? Find out how computers and programs learn without being subjected to rules or…

Encyclopedia
AI