Тёмный

08 Working with Strings, Dates and Null 

Ease With Data
Подписаться 3,9 тыс.
Просмотров 2,2 тыс.
50% 1

Video explains - How to use Case When in Spark ? How to manipulate String data in Spark DataFrames? How to cast dates in Spark ? How to extract date portions in Spark ? How to work with NULL data in Spark ?
Chapters
00:00 - Introduction
01:08 - How to use Case When in Spark?
04:30 - String Regex Replace
06:00 - How to convert string to date in Spark?
08:10 - How to add current date or timestamp in Spark ?
10:07 - How to drop NULL records in Spark ?
10:50 - How to transform NULL Columns in Spark ?
12:18 - Fix DataFrame
14:00 - Bonus Tip
Local PySpark Jupyter Lab setup - • 03 Data Lakehouse | Da...
Python Basics - www.learnpython.org/
GitHub URL for code - github.com/subhamkharwal/pysp...
Documentation Spark Functions - spark.apache.org/docs/latest/...
Documentation Date/Timestamp Patterns - spark.apache.org/docs/latest/...
The series provides a step-by-step guide to learning PySpark, a popular open-source distributed computing framework that is used for big data processing.
New video in every 3 days ❤️
#spark #pyspark #python #dataengineering

Опубликовано:

 

15 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 17   
@vijayvavilapalli1002
@vijayvavilapalli1002 Год назад
Wonderful.. I ever seen these kind of teaching.. thankyou bro!! Please add more videos.
@easewithdata
@easewithdata 9 месяцев назад
Sure, I am working on it now.
@anonymous-ze5fg
@anonymous-ze5fg 11 месяцев назад
great content, Please keep adding more videos, very helpful.
@easewithdata
@easewithdata 9 месяцев назад
Thanks, will do!
@marimuthukalyanasundram3151
@marimuthukalyanasundram3151 2 месяца назад
You're a very awesome guy. Your explanation is straightforward to understand. I have a few clarifications. Why do we have to import the libraries for each function? Is there an option to import the main libraries and achieve the same? For example, for the date conversion, you import date_format and the_date. I believe we can use Import *
@easewithdata
@easewithdata 2 месяца назад
Hello, Thank you. Please share this with your network over LinkedIn ❤️ And for the second part, yes you can import as per your choice. Only importing required functions make it more neat and optimized.
@marimuthukalyanasundram3151
@marimuthukalyanasundram3151 2 месяца назад
@easewithdata, definitely I will do that. Keep following this energetic training. You have a very bright future in the IT world.
@irannamented9296
@irannamented9296 6 дней назад
need to understand one thing why yyyy and dd not in capital letter is there any reason for that
@easewithdata
@easewithdata 5 дней назад
Spark follows the following datetime pattern format (mostly resembles to Unix formats) spark.apache.org/docs/latest/sql-ref-datetime-pattern.html
@passions9730
@passions9730 Год назад
Good content
@easewithdata
@easewithdata Год назад
Thanks 👍 Please make sure to share with your network 🛜
@aryans4519
@aryans4519 2 месяца назад
Can we use na.fill to fill missing values, instead of coalesce?
@easewithdata
@easewithdata 2 месяца назад
coalesce is used for condition handling for nulls. na.fill will do the genaric fill for the columns.
@aryans4519
@aryans4519 2 месяца назад
Thanks, this cleared my doubt 😀
@pranavganesh1855
@pranavganesh1855 6 месяцев назад
Bro, what is the purpose of using coalesce here??
@easewithdata
@easewithdata 6 месяцев назад
It is being used to transform null values. It works sane as nvl in sql. We even have coalesce in SQL. I know you might be confusing it with partitioning coalesce. But currently its a column transformation to fix null values. Partitioning one is applied on table level.
@pranavganesh1855
@pranavganesh1855 6 месяцев назад
@@easewithdata Thank you..
Далее
09 Sorting data, Union and Aggregation in Spark
10:10
Просмотров 1,7 тыс.
Kafka Tutorial - Core Concepts
13:04
Просмотров 918 тыс.
ЛУЧШАЯ ПОКУПКА ЗА 180 000 РУБЛЕЙ
28:28
How Ai Is About To Transform The World’s Economy
19:19
14 Read, Parse or Flatten JSON data
17:50
Просмотров 1,8 тыс.
Build an SQL Agent with Llama 3 | Langchain | Ollama
20:28
24 Fix Skewness and Spillage with Salting in Spark
21:17
The ONLY PySpark Tutorial You Will Ever Need.
17:21
Просмотров 123 тыс.
26 Spark SQL, Hints, Spark Catalog and Metastore
19:20
Просмотров 1,3 тыс.
Simulating the Evolution of Rock, Paper, Scissors
15:00
ЛУЧШАЯ ПОКУПКА ЗА 180 000 РУБЛЕЙ
28:28