Suchen und Finden

Service

Multi-Modal Machine Learning. An Introduction to BERT Pre-Trained Visio-Linguistic Models

Multi-Modal Machine Learning. An Introduction to BERT Pre-Trained Visio-Linguistic Models

Johanna Garthe

Verlag GRIN Verlag , 2023

ISBN 9783346983749 , 22 Seiten

Format PDF

Kopierschutz frei

Geräte

Mehr zum Inhalt

Multi-Modal Machine Learning. An Introduction to BERT Pre-Trained Visio-Linguistic Models

Kapitelkauf
Kurzinformation
Inhaltsverzeichnis
Leseprobe
Blick ins Buch
Fragen zum eBook

Seminar paper from the year 2021 in the subject Computer Sciences - Computational linguistics, grade: 1,3, University of Trier (Computerlinguistik und Digital Humanities), course: Mathematische Modellierung, language: English, abstract: In the field of multi-modal machine learning, where the fusion of various sensory inputs shapes learning paradigms, this paper provides an introduction to BERT-based pre-trained visio-linguistic models by specifically summarizing and analyzing two approaches: ViLBERT and VL-BERT, aiming to highlight and discuss their distinctive characteristics. The paper is structured into five chapters as follows. Chapter 2 lays the fundamental principles by introducing the characteristics of the Transformer encoder and BERT. Chapter 3 presents the selected visual-linguistic models, ViLBERT and VL-BERT. The objective of chapter 4 is to summarize and discuss both models. The paper concludes with an outlook in chapter 5. Transfer learning is a powerful technique in the field of deep learning. At first, a model is pre-trained on a specific task. Then fine-tuning is performed by taking the trained network as the basis of a new purpose-specific model to apply it on a separate task. In this way, transfer learning helps to reduce the need to develop new models for new tasks from scratch and hence saves time for training and verification. Nowadays, there are different such pre-trained models in computer vision, natural language processing (NLP) and recently for visio-linguistic tasks. The pre-trained models presented later in this paper are both based on and use BERT. BERT, which stands for Bidirectional Encoder Representations from Transformers, is a popular training technique for NLP, which is based on the architecture of a Transformer.

Mediengruppe Stein

Buchhandlungen

Öffentliche Hand

Education

Bibliotheken

RWS

Corporate

Medical

Service

Support-Hotline

Sicherheitsgarantie

Mehrfachnutzung

Mobile Endgeräte

Download-Voucher

Aus- und Weiterbildung

Shop

Alle Preise verstehen sich inklusive der gesetzlichen MwSt.

Download-Voucher einlösen

Kopierschutz

= Kopierschutzfrei

= Wasserzeichen

= DRM Kopierschutz

Installieren Sie Adobe Digital Editions

© 2018-2024 Quolibris GmbH
AGB
Datenschutz
Impressum
Kontakt
F.A.Q
Widerruf