Following Links With Scrapy (regex is GOOD here)

John Watson Rooney

Подписаться 86 тыс.

Просмотров 4,4 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

30 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 38

@bakasenpaidesu 2 года назад

Bro anyway to scrap google search result links??? There is a great google search module available only problem is that it doesn't support proxy... Can u make a video....

@Kyosika 2 года назад

Great vid once again! :)

@JohnWatsonRooney 2 года назад

Thanks Kyo!

@nothingcamefromnothing23ye53 2 года назад

I'm always following your channel... Always waiting for Your videos... ♥️🌀

@JohnWatsonRooney 2 года назад

Thanks I appreciate it!

@tamsssss6765 2 года назад

idk what im doing wrong here "SyntaxError: 'yield' outside function"

@divyamkumarsingh1953 2 года назад

Nice video 👍 can you make a video on how to scrape websites that use cloudflare. Almost all crypto websites use it, for example opensea etc.

@JohnWatsonRooney 2 года назад

I tend to stay away from sites that use bot protection like cloudflare. It can be passed but often isn't worth it due to the expense and time needed.

@scottzimmer8336 2 года назад

John, Your videos are quite interesting and you do a great job explaining as you go. I did some searching around but did not see if you have already tackled vrbo or a site which is setup similarly. I have had rather poor results overall (but of course I am a bit of a newbie). I am curious to compare rates for a given location.

@karthikb.s.k.4486 2 года назад

Nice tutorail as always. What is the theme of VS code used. Please let me know

@JohnWatsonRooney 2 года назад

It’s catpuccin

@uscjake868 2 года назад

Thanks John! I would love to see a video where you identify reasons why automated clicks from various browser automation tools (playwright, selenium) are not clicking something on the page. I have run into jquery/ajax/bot blocking issues where it just won't click it. Love the videos!

@nothingcamefromnothing23ye53 2 года назад

Can you do a video about scrapy inline request method ? , I didn't find a video about using scrapy inline request , it will be very helpful to everyone...

@vivekpatel009 2 года назад

Hii. i thought it would be nice. If you make video on how to test a big scrapy project without running the whole script. I tend to replace return with yield. So control my script. But i am guessing, you might have other ways to do it. Thank you

@sahilasif40 2 года назад

Hey John! I have a bit of a nooby question, how do we use multiple regular expressions in the Rule?

@mehedihossain2573 2 года назад

Best....very helpful

@JohnWatsonRooney 2 года назад

Thank you

@nachoeigu 2 года назад

Great video, I would like to ask you pne question. How to know the amount of request you can make for second to a server in order to avoid break it? I mean, I have to scrape 30k pages and if I use time.sleep(2) is so slow! But if I make all in once, I am scared to break the server

@JohnWatsonRooney 2 года назад

The main issue you’ll find it that your ip will get blocked usually if you try to scrape too fast. It’s very unlikely you’ll take down a server!

@nomirrors3552 2 года назад

Does no one use Beautiful Soup anymore?

@JohnWatsonRooney 2 года назад

I still do, just this video was about Scrapy

@nomirrors3552 2 года назад

@@JohnWatsonRooney Okay, now I understand. I was watching this and thinking it's a few lines in BS... Thanks for the content!

@Sree-en2qv 2 года назад

Hello Sir, I am from India. You're videos are really awesome. It helped me a lot. Thank you so much from my heart. ❤

@JohnWatsonRooney 2 года назад

Thank you that’s very kind

@GelsYT 2 года назад

Hi! THANK YOU! VERY COOL TUTORIAL AND YOU'VE DONE IT IN JUST less than 10 minutes. THANKS! Just a question, does the Rule objects must be in a variable named "rules"? and the order of those Rule classes but be accordingly right? Because if the 2nd Rule object which contains the callback is the first element of the rules list. It wouldn't work as expected right? THANKS!

@hamzaazzouz8056 2 года назад

great, I needed this as I use scrapy, the crawl part is a bit scary , u made it look easy.

@JohnWatsonRooney 2 года назад

Keep working at it and build it up bit by bit you’ll get there

@alfakih7247 Год назад

How did you do that

@tnssajivasudevan1601 2 года назад

Great Video Sir Really informative

@JohnWatsonRooney 2 года назад

Thank you!

@atakaragyozov 2 года назад

As always great tutorial, thanks!

@JohnWatsonRooney 2 года назад

Thanks very kind

@stephenwilson0386 2 года назад

Seems like you're on a different OS in every video. Looks like we have standard Ubuntu here instead of WSL? Jokes aside, another super helpful video! Every time I get stuck it seems you've already made a video to address the issue. Many thanks for your channel!

@JohnWatsonRooney 2 года назад

Hah my older videos are WSL, but I made the switch to Linux around a year ago, so going forward all will be Ubuntu/whatever I move too to get away from snap when I’m ready

@stephenwilson0386 2 года назад

@@JohnWatsonRooney Nice! I think we had this conversation in one of your live streams. I did a bunch of distro hopping but finally landed on openSUSE Tumbleweed with the Gnome desktop - you get a rolling release that still has very good stability, and great software compatibility since it's derived from an enterprise OS. Anyways happy scraping, thanks again for the content!

@JohnWatsonRooney 2 года назад

@@stephenwilson0386 Yes I remember. I've also done my fair share of distrohopping but before it was always more of just curiosity - I learned a lot... Main thing was how much I love i3wm and tiling!

@stephenwilson0386 2 года назад

@@JohnWatsonRooney I tried tiling for awhile and mostly enjoyed it (i3 and bspwm), but I'd always end up missing some of the creature comforts of a full desktop. I've thought about going back maybe as a secondary install though. It's one of the great things about Linux, there's no "one-size-fits-all" forced on you like Windows or Mac.

@ervankurniawan41 2 года назад

Do you have spesific course that explain Scrapy in advanced/intermediate level? Sounds good if you create one on learning platform like Udemy, bcz i like your way to guide us through this channel. 🥲