Yannic Kilcher

Yannic Kilcher

462
15 739 952

Подписаться

I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society.

Twitter: twitter.com/ykilcher
Discord: ykilcher.com/discord
BitChute: www.bitchute.com/channel/yannic-kilcher
LinkedIn: www.linkedin.com/in/ykilcher
BiliBili: space.bilibili.com/2017636191

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: www.subscribestar.com/yannickilcher
Patreon: www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

[ML News] OpenAI is in hot waters (GPT-4o, Ilya Leaving, Scarlett Johansson legal action)

29:22

[ML News] OpenAI is in hot waters (GPT-4o, Ilya Leaving, Scarlett Johansson legal action)

21 день назад

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)

33:26

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)

Месяц назад

[ML News] Chips, Robots, and Models

39:14

[ML News] Chips, Robots, and Models

Месяц назад

TransformerFAM: Feedback attention is working memory

37:01

TransformerFAM: Feedback attention is working memory

Месяц назад

[ML News] Devin exposed | NeurIPS track for high school students

17:47

[ML News] Devin exposed | NeurIPS track for high school students

Месяц назад

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

37:17

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Месяц назад

[ML News] Llama 3 changes the game

31:19

[ML News] Llama 3 changes the game

Месяц назад

Hugging Face got hacked

18:01

Hugging Face got hacked

Месяц назад

[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news)

9:55

[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news)

2 месяца назад

[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)

27:32

[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)

2 месяца назад

Flow Matching for Generative Modeling (Paper Explained)

56:16

Flow Matching for Generative Modeling (Paper Explained)

2 месяца назад

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer)

44:05

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer)

2 месяца назад

[ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act

27:00

[ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act

2 месяца назад

[ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction

26:50

[ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction

2 месяца назад

[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama

53:15

[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama

3 месяца назад

No, Anthropic's Claude 3 is NOT sentient

15:12

No, Anthropic's Claude 3 is NOT sentient

3 месяца назад

[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles

42:34

[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles

3 месяца назад

Gemini has a Diversity Problem

17:36

Gemini has a Diversity Problem

3 месяца назад

V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)

50:03

V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)

3 месяца назад

What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news)

1:23:59

What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news)

3 месяца назад

Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained)

54:24

Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained)

4 месяца назад

AlphaGeometry: Solving olympiad geometry without human demonstrations (Paper Explained)

35:27

AlphaGeometry: Solving olympiad geometry without human demonstrations (Paper Explained)

4 месяца назад

Mixtral of Experts (Paper Explained)

34:32

Mixtral of Experts (Paper Explained)

5 месяцев назад

Until the Litter End

3:40

Until the Litter End

5 месяцев назад

LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained)

31:45

LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained)

5 месяцев назад

I created an AI-powered Social Network

8:17

I created an AI-powered Social Network

5 месяцев назад

NeurIPS 2023 Poster Session 4 (Thursday Morning)

57:52

NeurIPS 2023 Poster Session 4 (Thursday Morning)

5 месяцев назад

Art @ NeurIPS 2023

8:26

Art @ NeurIPS 2023

5 месяцев назад

Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)

40:40

Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)

5 месяцев назад

Комментарии

@Raju-ib9ug 15 часов назад

Been seeing a lot of talk in the private circles about cyberopolis.

@user-ir8pp5br3o 15 часов назад

Hey man, I agree with you, lots of money will be made. But real money actually lays in pre-sales, and I think Cyberopolis is one of the best picks for the bull run.

@user-ne5vk5dy7w 15 часов назад

Guys airdrops are for kids, invest like a real pro in presales and ICOs. The real money awaits you there. Cyberopolis is the next big thing in defi

@ANKIT_GAMING193 15 часов назад

Cyberopolis been the hot topic in several groups I'm in.

@sanjjanadevi 15 часов назад

Not a shill, but that Times Square announcement was hard to ignore. Cyberopolis will be big!

@amanprajapati8707 15 часов назад

My top picks for this bull run are Illuvium, Verasity and Cyberopolis.

@alonekabir1189 15 часов назад

There is one project that stands out and it is in early phases. It is Cyberopolis. 10m already raised from institutions and now public have their opportunity

@MdMozmmel-id4oo 15 часов назад

Only interested in project that can do at least 10x in next 3 months. I´m betting on Cyberopolis.

@nusratfatima225 15 часов назад

Waiting after ETF for retrace, then stack some alts. In the meantime, I think the best is to invest in presale projects like Cyberopolis for maximum returns

@sajithfaraday2229 15 часов назад

Check out Cyberopolis, 50x soon!

@CAR---X 15 часов назад

Cyberopolis presale is on Times Square video wall, get in while you can!

@RABARiBoss99 15 часов назад

As a presale enthusiast, Cyberopolis CYBER caught my eye. Betting on it for a potential 200x post-presale!

@ravindramore4783 15 часов назад

Caught some insider buzz about cyberopolis and the names involved.

@chandruing7243 15 часов назад

In the world of presales, Cyberopolis CYBER stands out. Brace yourselves for a possible 200x journey!

@govinda4577 15 часов назад

I´m bullish on VRA, Joystream and Cyberopolis. What do you think guys about my picks?

@gangadhotre4742 15 часов назад

Just stumbled upon Cyberopolis after some deep research. The Times Square announcement caught my attention - anyone else intrigued by this project?

@AmarSingh-fh3xp 15 часов назад

Big shots are piling in on cyberopolis, so I decided to get my share. Let's hope it pays off!

@SanjaySanjay-mw5qc 15 часов назад

God bless you all, I hope we all get many x returns by the end of the bull run. I am betting in 100 bucks that Cyberopolis will do 150x by the end of 2024.

@user-pj2io5cx3z 15 часов назад

You forgot to mention Cyberopolis. It will destroy other alts. Still early to ape in.

@BobyKumar-hy3iy 15 часов назад

Cyberopolis is all over the news and portals. I think its a great investment opportunity.

@YashwanthkumarHC 15 часов назад

Cyberopolis, easy 50-200x in next few months

@user-kp7er6ye4p 15 часов назад

My plan is to find as many presales and ICOs possible, to be early. You still have time to ape in Cyberopolis

@RamSingh-gq9yb 15 часов назад

Not here to shill, but the Times Square ad for Cyberopolis made me Google it.

@deepchandpatel214 15 часов назад

Love BTC the most, but it is time to put some into presales. Cyberopolis is the next big thing.

@vanshbibyan35 15 часов назад

Some serious backing on cyberopolis

@T.D_gamer 15 часов назад

I like the Pi project, but still there is no money to be made here. I put my holdings in Cyberopolis, easy 50-200x

@KamalMondal-vl6fo 15 часов назад

Yep, the signals are strong on cyberopolis, especially with the big endorsements.

@LakhvirSingh-gm8og 15 часов назад

You are right. My strategy is to find as many good projects in presale phase as I can. Real money is there. Cyberopolis is my number one pick, and presale is almost over.

@LuRen-bz4kd 21 час назад

Thanks for your video. I'm curious as to why it can outperform alpha-zero on go, because it seems to be training a simulation of an environment model, and the rest of the training may be the same with alpha-zero. The best case scenario is that it trains a simulated environment that matches the real world, but that can only achieve near-alpha-zero results.

@pathworker2010 2 дня назад

since this "licence" has begun propagating, the development of Ai software for general use( especially in the open source arena)has shown signs of slowing down to a crawl. another dirty trick that some unscrupulous vendors will be tempted to employ is malicious code injection with a timed delay so that that, at a predetermined date the model( or software) will trigger a virus or other malware on the end users computer, forcing the user to remove or update the software or model so "solve" the problem,( i've seen this happen with anydiffusion v3 both pruned and full model)

@AmirNajafgholi 2 дня назад

Don't you want to review KAN?

@Dyxuki 3 дня назад

this is VERY cool. but I believe that if given multiple views (image) of a scene, and if we are able to match enough points, it's possible to generate a photogrametric model of the scene (a little bit like stereo vision, but with many more view so the generated model is more complete), and once we have that, we can simply sample or reproject it to any point of view. Isn't that a simpler way of solving the same problem?

@corgirun7892 3 дня назад

the baseline is unfair

@lenant 4 дня назад

Thanks for explanation! But what do they consider as y_l? What are these tokens, which probability shall be lower, how do they select it?

@lenant 4 дня назад

I see in paper they use datasets argilla/ultrafeedback-binarized-preferences-cleaned and Anthropic/hh-rlhf, but i don't quite understand how teacher forcing works here with 2 different sequences.

@lenant 4 дня назад

Reading more into parer and I think I got it: they don't add L_or to each token, but rather to whole loss from sft (gathered from generated tokens) and L_or is caluclated as probability over whole chosen and rejected sequences.

@luck3949 4 дня назад

You just train a large vision model on all RU-vid videos, and it will learn all mentioned priors on its own.

@BlackCatRedScarf 5 дней назад

That essentially poisoned the trust between 4chan users at this point.

@EpicGamer-ux1tu 6 дней назад

Oh wow, finally, we finally got RNNs

@Wobbothe3rd 7 дней назад

Richard Sutton has the best substantive answers to that question, imo.

@escher4401 7 дней назад

Has anyone tried to do this on NeRFs?

@YangLi-gw9nb 7 дней назад

This is definitely the best explanation video for Mamba I've seen. Thank you!

@duongbinh23 7 дней назад

6:50 RNN: y(t+1) = sigmoid(y(t) + x(t)) State-space: Well, let’s get rid of non-linear sigmoid so that y(t+1) = y(t) + x(t) = … = y(0) + x(0) + x(1) + … + x(t) Now you change the game lol.

@hanskraut2018 8 дней назад

I could have told you when i was in the end of Kindergarden. I hope there is more behind it than what it sounds to be.

@pritioli8429 9 дней назад

great explanation!

@MarkusFjellheim 10 дней назад

32:00 I think the model could still learn what actions outside the context window it should take to get future rewards. Even with a context window of 1, it could look at its training data and see 2 random agents taking different actions and having different returns to go and from this know what actions are better. If you have enough training data the model should be able to generalize over the environment and make these predictions in general

@Vipulvaghela176 11 дней назад

Anyone else hearing about Qventi? It’s getting a lot of buzz, just like Revux.

@devrajdevraj3369 11 дней назад

Got a strong feeling Revux will go 100x once it hits Binance.

@KavinKaviya-bw7rb 11 дней назад

I'm all in on Revux. Presales have the highest returns, and this one’s gold.

@BijoyBaskeBijoyBaske 11 дней назад

Shifting my portfolio - heavy on BTC and Revux, with a sprinkle of DOT and ADA.

@kiranshinde2672 11 дней назад

Revux partnering with Shopify is massive news. This token is set to explode!

@user-zb8hd1gh8x 11 дней назад

I see Revux doing 50x, maybe even 100x after it goes live on major exchanges.