YouTube 자막:
W2 7 Pretraining an LLM

동영상을 끝까지 볼 필요 없이 전체 자막을 가져오고, 키워드를 검색하고, 한 번에 복사하세요.

AutoDub

YouTube 외국어 영상 이해하기

몰입형 YouTube 한국어 더빙

언어 장벽을 넘어 전 세계 양질의 콘텐츠를 즐기세요

무료로 사용

동영상 자막

동영상 요약

Summary

Core Theme

Pre-training large language models (LLMs) from scratch is prohibitively expensive and resource-intensive for most applications, making it more practical to fine-tune existing, pre-trained models.

Mind Map

클릭해서 펼치기

클릭해서 인터랙티브 마인드맵 전체 보기

many of the LMS we've been using have

been previously trained or we say

pre-trained by some company Often by a

big tech company when should you

pre-train your own model this turns out

to be so expensive that when in doubt I

would say probably don't do it but let's

take a deeper look many teams have been

pre-training general purpose LMS by

learning from text on the internet these

efforts to train very large language

models May cost tens of millions of

dollars need a large dedicated

engineering team take many months and a

huge amount of data many teams have been

open sourcing such models and that's

been a fantastic contribution to the AI

Community if you have the resources to

pre-trade models and maybe even open

source them please by all means make

that contribution to AI I think that

could be fantastic but for building a

specific application given the time and

expense of pre-training a model from

scratch I think of this as often an

option of L result it could help if you

have a highly specialized domain and a

lot of data for example Bloomberg is a

company that offers software as well as

media articles centered around Financial

Services because of its access to a huge

amount of TX on finance it trained

Bloomberg GPT which is Bloomberg's

custombuilt large language model purpose

built for financial applications and

Bloomberg reported that compared to

general purpose LS that had learned

mainly from internet data this model

does quite a bit better on processing

Financial Texs for many practical

applications unless you have a huge

amount of resources and a huge amount of

data it may be more practical to start

with an OM that someone else had

pre-trained say a general purpose LM

that's learned from a lot of internet

data and that someone has opened source

and then to fine-tune that to your own

data and that will often give pretty

decent performance but in a much more

economic way now I am sincerely very

grateful to the teams that have been

putting a lot of resources into

pre-training LMS on a lot of Text data

on the internet and then open- sourcing

them and in fact this gives us many

different LMS that we could choose from

to use in the next video we'll actually

take a look at the issue of

what size omm do you want to use and of

all the different Elms out there how do

you think about choosing among different

ones let's go take a look at that in the

텍스트나 타임스탬프를 클릭하면 동영상의 해당 장면으로 바로 이동합니다

대부분의 자막은 5초 이내에 준비됩니다

원클릭 복사125개 이상의 언어내용 검색타임스탬프로 이동

YouTube URL 붙여넣기

YouTube 동영상 링크를 입력하면 전체 자막을 가져옵니다

대부분의 자막은 5초 이내에 준비됩니다

Chrome 확장 프로그램 설치

YouTube를 떠나지 않고 자막을 즉시 가져오세요. Chrome 확장 프로그램을 설치하면 동영상 시청 페이지에서 바로 자막에 원클릭으로 접근할 수 있습니다.

Chrome에 추가 — 무료

YouTube, Coursera, Udemy 등 주요 교육 플랫폼 지원

자막을 바로 가져오려면: 주소창에서 도메인만 바꾸면 됩니다!

YouTube

←

→

↻

https://www.youtube.com/watch?v=UF8uR6Z6KLc

YoutubeToText

←

→

↻

https://youtubetotext.net/watch?v=UF8uR6Z6KLc

YouTube 자막결과를 준비하고 있습니다…

YouTube 자막:W2 7 Pretraining an LLM