Seattle Data Guy

Seattle Data Guy

251
4 960 183

Подписаться

Welcome to my channel.

I make videos on data science, data engineering and consulting.

Follow me on:

Linkedin: www.linkedin.com/company/18129251/
Facebook: facebook.com/SeattleDataGuy
Twitter: twitter.com/SeattleDataGuy

Going From Data Engineer To Head Of Data - How To Run A Data Team Successfully

7:44

Going From Data Engineer To Head Of Data - How To Run A Data Team Successfully

14 дней назад

Apache Spark Vs Apache Flink - Looking Through How Different Companies Approach Spark And Flink

26:11

Apache Spark Vs Apache Flink – Looking Through How Different Companies Approach Spark And Flink

Месяц назад

Intro To Databricks SQL AI Functions - 5 SQL AI Functions Databricks Has And How To Use Them

35:07

Intro To Databricks SQL AI Functions - 5 SQL AI Functions Databricks Has And How To Use Them

Месяц назад

If I could give advice to myself when starting as a data engineer

11:14

If I could give advice to myself when starting as a data engineer

Месяц назад

Data Modeling Where Theory Meets Reality - How Different Companies I Worked At Modeled Their Data

18:13

Data Modeling Where Theory Meets Reality - How Different Companies I Worked At Modeled Their Data

2 месяца назад

How To Escape The Rat Race - 6 Tips I Wish I Had Before I Became An Independent Consultant

12:32

How To Escape The Rat Race - 6 Tips I Wish I Had Before I Became An Independent Consultant

2 месяца назад

What Is S3 And How Can You Query It With AWS Athena - AWS Data Engineering 101

15:56

What Is S3 And How Can You Query It With AWS Athena - AWS Data Engineering 101

3 месяца назад

What Tools Should Data Engineers Know In 2024 - 100 Days Of Data Engineering

17:31

What Tools Should Data Engineers Know In 2024 - 100 Days Of Data Engineering

3 месяца назад

Using AWS Lambda As A Data Engineering - Automating An API Extract With AWS Lambda And Eventbridge

18:01

Using AWS Lambda As A Data Engineering - Automating An API Extract With AWS Lambda And Eventbridge

3 месяца назад

Best AWS Services You Need To Know As A Data Engineer - How To Become A Data Engineer

11:22

Best AWS Services You Need To Know As A Data Engineer - How To Become A Data Engineer

4 месяца назад

Optimizing Your Data Infrastructure - How To Become A Better Data Engineer

12:45

Optimizing Your Data Infrastructure - How To Become A Better Data Engineer

4 месяца назад

Data Modeling - Walking Through How To Data Model As A Data Engineer - Dimensional Modeling 101

20:43

Data Modeling - Walking Through How To Data Model As A Data Engineer - Dimensional Modeling 101

4 месяца назад

How And Why Data Engineers Need To Care About Data Quality Now - And How To Implement It

16:06

How And Why Data Engineers Need To Care About Data Quality Now - And How To Implement It

5 месяцев назад

Fastest way to Start Your Data Engineer Journey in 2024 - 100 Days Of Data Engineering Crash Course

15:57

Fastest way to Start Your Data Engineer Journey in 2024 - 100 Days Of Data Engineering Crash Course

5 месяцев назад

The Ultimate Guide To Starting An Independent Consulting Company In 2024 | Data Consulting 101

25:47

The Ultimate Guide To Starting An Independent Consulting Company In 2024 | Data Consulting 101

6 месяцев назад

Data Modeling - Why Data Engineers Need To Understand It - An Introduction To Data Engineering

17:38

Data Modeling - Why Data Engineers Need To Understand It - An Introduction To Data Engineering

7 месяцев назад

What Is Apache Druid And Why Do Companies Like Netflix And Reddit Use It?

11:41

What Is Apache Druid And Why Do Companies Like Netflix And Reddit Use It?

8 месяцев назад

The Realities Of Airflow - The Mistakes New Data Engineers Make Using Apache Airflow

12:26

The Realities Of Airflow - The Mistakes New Data Engineers Make Using Apache Airflow

8 месяцев назад

Data Architects Vs Data Engineers - Is There A Difference?

11:04

Data Architects Vs Data Engineers - Is There A Difference?

9 месяцев назад

What Is Docker - Docker Intro And Tutorial On Setting Up Airflow | High Paying Data Engineer Skills

22:25

What Is Docker - Docker Intro And Tutorial On Setting Up Airflow | High Paying Data Engineer Skills

9 месяцев назад

How To Fast Track Your Data Engineering Career - Translating Business Requirements Into Value

12:08

How To Fast Track Your Data Engineering Career - Translating Business Requirements Into Value

10 месяцев назад

Everyone's Data Infrastructure Is A Mess - The Truth About Working As A Data Engineer

12:54

Everyone's Data Infrastructure Is A Mess - The Truth About Working As A Data Engineer

11 месяцев назад

Data Modeling Challenges - The Issues Data Engineers & Architects Face When Implementing Data Models

12:51

Data Modeling Challenges - The Issues Data Engineers & Architects Face When Implementing Data Models

11 месяцев назад

Why I Left Data Science - And Picked Data Engineering Instead

7:14

Why I Left Data Science - And Picked Data Engineering Instead

Год назад

What Is Change Data Capture - Understanding Data Engineering 101

7:27

What Is Change Data Capture - Understanding Data Engineering 101

Год назад

How I'd Become A Data Engineer (If I had to start over as a data analyst in 2023)

13:23

How I'd Become A Data Engineer (If I had to start over as a data analyst in 2023)

Год назад

A Decade In Data Engineering - Has Anything Actually Changed?

14:27

A Decade In Data Engineering - Has Anything Actually Changed?

Год назад

Data Engineering Vs Machine Learning Pipelines - What Is The Difference

12:01

Data Engineering Vs Machine Learning Pipelines - What Is The Difference

Год назад

Will Data Engineering Exist In 5 Years - Is Data Engineering A Good Career Choice?

9:32

Will Data Engineering Exist In 5 Years - Is Data Engineering A Good Career Choice?

Год назад

Комментарии

@i-see-right-through-you День назад

I've worked with Palantir. It looks glitzy and easy and click click click, but its the type of thing that if you have to do anything thats out of the ordinary you have to bend their rigid "objects" into new shapes that they don't like. I'm pretty sure they bought all the "Apps" from different vendors because they all seem to have different paradigms - not a cohesive development environment at all. And the documentation sucks. Definitely mostly sad path here. Notice how often he has to click around ALOT just to do something really really simple? And he doubts if he's done something right (or wrong) because even after doing the little thing he's demoing, its just confusing and a mess. There is actually a code interface, but I remember assuming that I could loop a directory and act upon every object in the folder, but dig this, after spending a bunch of time working towards a solution with that assumption in mind, it turns out, no, you can't loop the objects in a folder. WTF? WShzt doesn't allow ypu to do that through their API or any other way? "Hopefully this doesn't break." Lol love that quote. Exactly how I felt working with this piece of trash every day.

@AdamTemple8888 4 дня назад

Cool channel! I noticed that Snowflake's stock has been pretty bombed out since IPO. Are they still as integral as they were when they went public?

@RamzyTheDad 5 дней назад

Appreciate this video, something I’m considering myself. Good to hear from a seasoned pro like you!

@SeattleDataGuy 5 дней назад

Glad it was helpful! let me know if you have any other questions!

@Anonymous_moron 5 дней назад

I’m onboarding to Meta literally today, and this video tied together most of the tools for me! 😊

@SeattleDataGuy 5 дней назад

glad to hear it! also it sounds like a lot of the tools are the same as they were when I was there!

@farrukhnasirsiddiqui7471 5 дней назад

Hi, How to start? What are the platforms where I can get clients?

@TushaarDwivedi 5 дней назад

The voice is not clear. Better work on it. It spoils the whole experience of listening.

@SeattleDataGuy 5 дней назад

any thoughts on mics?

@RamzyTheDad 5 дней назад

@@SeattleDataGuylook into compression. Sometimes a good compressor setting for the mic audio helps make the voice pop

@TushaarDwivedi 4 дня назад

@SeattleDataGuy I'm not sure about the mic. It could be settings. Also, I request that you upload the videos in full HD at least.

@jayeshmhaske8839 4 дня назад

@@SeattleDataGuy voice was fine to me

@lucasthomspon4775 5 дней назад

Very good content, as always!!!!

@SeattleDataGuy 5 дней назад

thank you!

@karszn 5 дней назад

Hey Ben. Thanks for this

@SeattleDataGuy 5 дней назад

you're welcome!

@madukomablessed4712 6 дней назад

Hi Ben, awesome roadmap, even though you are yet to update some parts such as the article you are yet to upload and the live Q and A😂

@raphaeldayan 6 дней назад

awesome video, thank you!

@SeattleDataGuy 5 дней назад

Glad you liked it!

@hrsh3329 6 дней назад

🎉

@SeattleDataGuy 5 дней назад

thank you!

@raphaeldayan 7 дней назад

Thanks for the video!

@SeattleDataGuy 7 дней назад

you're welcome!

@nylarch4227 7 дней назад

dbt is a game changer - it can help create a cultural change where people actually like working with data teams because they see lineage, quality and quick turnarounds built into the system. Highly recommended!!!

@VnoteAI 8 дней назад

Thanks for the call out Christine Carragee! Fun to hear about people creating something clear when your brain would otherwise be shuffling ideas from "50 disc" CD player of thoughts.

@SeattleDataGuy 7 дней назад

She was a great guest, I'll have to have her on again!

@VnoteAI 7 дней назад

@@SeattleDataGuy 100%

@josephpark3949 9 дней назад

At like 22:30 you talk about find a wave (like llm) and partnering with a vendor. Can you please let me know how do I do that?

@SeattleDataGuy 7 дней назад

This would be a great question to pose to the consulting community I am growing, feel free to join and ask - the-technical-freelancer-academy.circle.so/getting-started

@josephpark3949 7 дней назад

@@SeattleDataGuy Thanks! I will

@dwm0916 9 дней назад

Christine, the word ‘lumpy’ is a wonderful descriptor for consulting income. It makes me think of my dad describing cycling in slightly hilly terrain - lumpy.

@SeattleDataGuy 7 дней назад

It is! i think of it kind of like riding waves. Some days have great waves, others small ones, and still others massive ones. You just gotta respect the ocean

@alexischicoine2072 10 дней назад

The price of the model for ai summarize is a lot less than llama 3 though. Also good to note the price for enterprise and premium seems to be the same which doesn’t seem to be the case in Snowflake.

@quangvinhle7912 11 дней назад

Will be data engineering ok in the future? I want to transition from a backend developer to a data engineer.

@SeattleDataGuy 7 дней назад

I think it'll change, I like what joe reis sad recently in his blog where he talked about DEs having to be more aware of data science and software so we can provide data for AI, data science, analytics, etc

@tomastruchly9484 11 дней назад

I love the "last mile" issue presentation when it comes to purely AI generated content. Therefore as mentioned only very few such solutions are deployed to Production. When it comes to IT consultants they are usually brought for short term (up to 1 year) when expertise / experiences are needed lacking in the core team. The demands are high as they tend to be very expensive. The issue is they come, do the work & leave. So it may become really difficult to do certain adjustments to the solution afterwards. Doing fine documentation during the development can help however not many companies are willing to pay the consultant's high rate to do it.

@SeattleDataGuy 7 дней назад

Yeah, the last mile was a great way of summarizing the issue and perhaps a great place for AI to play a role in trying to help bridge the gap

@lukemurphy3362 12 дней назад

What about using a surrogate key?

@harshmishra3534 13 дней назад

Hi the course *Big Data on Amazon web services (AWS)* isn't available anymore, Any alternative for that one? Thank you.

@ollienicholson 14 дней назад

Hey youtube.com/@SeattleDataGuy, love your videos so far! Was curious if you'd like to add your insight into the following terms? Batch processing v stream processing OLTP v OLAP

@SeattleDataGuy 7 дней назад

those would be some great topics, I have written on the OLTP vs OLAP topic before here - seattledataguy.substack.com/p/oltp-vs-olap-transactions-vs-analytics

@RedCodra_ 15 дней назад

Love hearing independent consulting best practices and what to expect! How do you approach accessing these clients' data and systems as a third party? Do they typically just give you a license (in the case of, say, Microsoft 365) as they would a W2 employee?

@SeattleDataGuy 14 дней назад

Sometimes, usually more a "seat" vs a license, I haven't often needed Microsoft products that are on a laptop, and if I have the client has sent me a physical laptop.

@FirstNameLastName-fv4eu 15 дней назад

This guy is the best example when you spend 10 yrs of your professional life in "super-cheap-money-world" what happens, a smart kid with a very vague idea of the real world :)

@SeattleDataGuy 15 дней назад

You think I am smart shucks. What is the real world to you?

@FirstNameLastName-fv4eu 15 дней назад

@@SeattleDataGuy explaining the same reason to a Bank where people dont evaluate a technology on "how much money" it has raised. Your generation is just spoiled or scammed by cheap money culture.

@SeattleDataGuy 14 дней назад

Who do you think is responsible for cheap money culture?

@DataPains 16 дней назад

Used it for years, I also tried the later 2.x version, I still don't like it, and I think there are better ways of architecting pipelines. But yeah I was amazed when I saw Airflow the first time, and it did solve a lot of problems, but I still think, it is a tool of the past. I hope I am wrong!

@SeattleDataGuy 14 дней назад

It's been a decade, so I wouldn't be surprised to see it replaced in the next 5 years. But never know, some things are hard to get rid of.

@DanielKamau-ku5cs 17 дней назад

Not clearly explained, just bs .

@Kira-ji5pr 18 дней назад

I’m thinking of switching from full stack to data engineering . Any advice ??

@SeattleDataGuy 14 дней назад

Is there a reason you want to switch?

@Sid-st5dn 3 дня назад

@@SeattleDataGuy money ??

@William-B 18 дней назад

We’re a young data team for a large organization. Biggest roadblocks for us are issues with data governance (“you can’t have or report on our data”), budget for tooling (“prove the value of the tool, then we can purchase it”), and cloud concerns (“all my data is on-prem. You can’t just put it in the cloud”)

@SeattleDataGuy 14 дней назад

Yeah, those are always a struggle. In some companies you'll never win that batter(until leadership gets changed out) in other cases you have to be willing to speak your mind and say, "Hey I can't do XYZ which you asked me to do under these conditions, so either things stay the same or you start opening doors". But thats easier to say as a consultant because I don't mind ending a project if a client won't work with me to get to the to the goal they wanted to get to(never had to go that far).

@smrtysam 19 дней назад

This has happened to me. Now I’m leading a team of data scientists, engineers, analysts and migration specialists. I’ve had to learn so much so quick about strategy and people management. I’ve had to coach the people on my team to really empower and own their own tasks. At the beginning of being head of data I was taking on way too many “low level tasks”. Now I’m delegating and empowering. I still have alot to learn though.

@SeattleDataGuy 14 дней назад

This is an awesome story of growth. Any tips for future heads of data?

@crisithink9509 19 дней назад

I wonder how much Data God has in the Aether/Astral realm 🤔

@SeattleDataGuy 14 дней назад

asking the real questions here!

@SeattleDataGuy 19 дней назад

If you're looking for help setting up your data team and strategy, then feel free to set-up a free consultation here - calendly.com/ben-rogojan/consultation

@Ian-vh2vv 19 дней назад

Just went thru this process with my company the past year. Great video. With us it went something like: - Where is all of our data - How are we doing reporting now - What are the shortcomings of existing reporting solutions - Do we need a warehouse (yes) - What warehouse do we pick - What ETL stack makes sense for our use case - What do we integrate in what order to maximize value and get adoption rolling Also, Having someone on the exec level champion the BI effort and really push it forward was huge for the thing to actually materialize.

@SeattleDataGuy 19 дней назад

Thanks for sharing! I really appreciate it when people add more context and their own experiences. Were there any gotchas you ran into while going through this process?

@baw5xc333 18 дней назад

How long did this rollout take?

@Ian-vh2vv 18 дней назад

@@baw5xc333 about 6 months from step 1 until I started development (first snowflake table and started integrating our first source system)

@sirus312 19 дней назад

I keep hearing from top CEOs that with Palantir we don't need teams anymore

@SeattleDataGuy 19 дней назад

I'd love to believe this! I guess the reason I have a hard time believing it is because I know there are lots of consultants that work in the space of setting up Palantir which suggests that it still requires technical skills to set-up and work with(also based on a few conversations I have had with people working with Palantir). But always happy to be wrong.

@hakeem1340 19 дней назад

Thank you for sharing

@SeattleDataGuy 19 дней назад

Thank you for watching!

@hantt 19 дней назад

the de role should not exist, it should just be sde who also own data as a product. kind of lile front end, backend, thete will be a data focused engineer, that we can call data engineer. o wait

@crypt_hodl 21 день назад

Interested! can you please have special pricing for people in Africa. 50% reduction is good but our earnings are way too low probably 20x less than those in US or Europe. It becomes difficult for us to participate in this type of good courses. Any help! Thanks.

@SeattleDataGuy 14 дней назад

Sure, happy to give a coupon, here is one for 80% off, once there are none left there are none left - lifetime_80

@crypt_hodl 13 дней назад

@@SeattleDataGuy Thank you so much, I really do appreciate. I am in now.

@data-dynamo-guy 21 день назад

I also find myself building stuff rather than analyzing business problems @@

@SeattleDataGuy 21 день назад

It's always interesting how we all come to the same conclusion, thanks for watching!

@Aristocle 22 дня назад

Is there a service or scripting language that allows me to write relationships between tables/databases in a modern material design style?

@serk-s 23 дня назад

Man, you really need to stop pitching your voice higher at the end of your sentences :(

@SeattleDataGuy 21 день назад

fair enough, on the flip side i have picked up a vocal fry trying to do that lol

@richardmartin6605 23 дня назад

Would love to see article reviews!

@SeattleDataGuy 21 день назад

awesome! any particular articles!

@initialb811 24 дня назад

This is really awesome. Would love to see more of this!

@SeattleDataGuy 21 день назад

this is one of my all time faves

@TJInTech10 24 дня назад

thx for breaking it down

@SeattleDataGuy 21 день назад

glad you found it helper!

@TJInTech10 21 день назад

@@SeattleDataGuy yes, thx , I'm trying to understand how Knowledge graph/Vector DB's will integrate into this too, is it safe to assume both will be essential pieces of the enterprise ai layer/stack now being invested in heavily, or do you see one being more relevant in next 2-5 yrs?

@SentinelaKosmos 28 дней назад

Don’t just be a task taker, be a strategic player.

@SeattleDataGuy 21 день назад

thanks for reading my articles and watching my videos!

@B-gaming930-fl5qr 29 дней назад

E5 is where it's at 750 Million 😂

@osoucy 29 дней назад

To me, one of the main benefit of Spark Structured Streaming is that you can easily switch between near real-time (micro batches) and scheduled batch processing without having to re-writing a single line of code. This is a very effective way of scaling up and down and balancing costs vs latency.

@SeattleDataGuy 14 дней назад

that is very useful! when do you think micro-batches make the most sense

@cestlachance7575 Месяц назад

Is this really a good video? i feel like he just namedrops every techs

@moussaelaqqaoui Месяц назад

Hello ben, can we have a discussion please !