Hi, I have not gotten good words to appreciate your work. Unnecessarily I have brought a course in Udemy without seeing this video. Really appreciate your hard work. Thank you so much and will follow other videos to learn snowflake.
Hi, for Question 7 I think the answer should be "Suspending the Warehouse" and "Passing of 24 hrs". Let's say I spin up a warehouse run some queries and then for next 48 hrs do not use the warehouse (and also do not suspend the warehouse). In such case the cache would be purged. I think this question is confusing or doesn't accounts for 24hr window.
I am glad it was helpful! Do let me know if there are any feedback or any other content are requirment which helps individual and community for snowflake education and snowflake certification.
Thank you Tushar for watching my video and sharing your feedback. your word of appreciation really means a lot to me. I have already published other knowledge series and snowflake certification videos and if you are interested, you can refer them. And please share your contructive feedback so I can improvise my content and help snowflake and data engineering community. Complete Guide + Question Dump + Revised Syllabus -------------------------------------------------- 👉 SnowPro Guide -> bit.ly/35S7Rcb 👉 Revised Jan 2021 Syllabus -> bit.ly/3tqOkJb 👉 SnowPro Practice Test Playlist of 300 questions -> bit.ly/2ZLQm9E You can also follow Data Engineering Channel -------------------------------------------------- 👉 Twitter -> twitter.com/de_simplified 👉 Facebook -> fb.me/dataengineeringsimplified 👉 GitHub -> github.com/TopperTips 👉 Website -> toppertips.com
In case of multi-cluster warehouse environment by adding new cluster or removing the additional cluster (resizing of a cluster) will result in releasing the warehouse cache ? or only in case of scale up and down case release the warehouse cache ?
Hi Hitendra, thanks for reaching me out again. Warehouse Cache behaviour is not different for multi cluster VWH. In really and under the hood, snowflake takes time to decommission those virtual instances when they are not in use and if scale up happens again, it associate it again and if the data is still available in that virtual machine. The only way to simulate it by looking into the query profiing or query history where you can see if scale up as really used the warehouse cache. Snowflake does not really talk openly about its implementation and they just explain at very conceptual level.
I have end to end video on etl and it can answer your question. you can also watch this end to end snowpark video ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-1jC98XQwBZw.html would suggest to download the summary sheet and find your answer 🚀🚀 Download Summary Sheet ➥ Complete Snowflake Tutorial rebrand.ly/maj4l6x ➥ Complete Snowflake JSON Guide rebrand.ly/d52cef
Can you suggest an answer for - Which operation does not require a Virtual warehouse Select count(*) from Table1 Select Max(col1) from Table 1 Select Avg(col1) from Table 1 select Sum(col2) from Table 1 Select Col1 + Col2 from Table 1
Thank you, Ashish, for watching my video and sharing your query. Only the Select count(*) from Table1 will not use the WH and rest all queries will WH. Just to make sure that I am not giving you incorrect input, here you can run queries and see the query history table to check the wh size. select count(*) from "SNOWFLAKE_SAMPLE_DATA"."TPCH_SF1"."CUSTOMER"; -- no use of WH select max(c_acctbal) from "SNOWFLAKE_SAMPLE_DATA"."TPCH_SF1"."CUSTOMER"; -- used the WH (see the query history tab) select avg(c_acctbal) from "SNOWFLAKE_SAMPLE_DATA"."TPCH_SF1"."CUSTOMER"; -- used the WH (see the query history tab) select sum(c_acctbal) from "SNOWFLAKE_SAMPLE_DATA"."TPCH_SF1"."CUSTOMER"; -- used the WH (see the query history tab) select c_acctbal+c_acctbal from "SNOWFLAKE_SAMPLE_DATA"."TPCH_SF1"."CUSTOMER" limit 10; -- used the WH (see the query history tab) I have already published other knowledge series and snowflake certification videos and if you are interested, you can refer them. And please share your contructive feedback so I can improvise my content and help snowflake and data engineering community. Complete Guide + Question Dump + Revised Syllabus -------------------------------------------------- 👉 SnowPro Guide -> bit.ly/35S7Rcb 👉 Revised Jan 2021 Syllabus -> bit.ly/3tqOkJb 👉 SnowPro Practice Test Playlist of 300 questions -> bit.ly/2ZLQm9E You can also follow Data Engineering Channel -------------------------------------------------- 👉 Twitter -> twitter.com/de_simplified 👉 Facebook -> fb.me/dataengineeringsimplified 👉 GitHub -> github.com/TopperTips 👉 Website -> toppertips.com
@@DataEngineering I did it multiple times and i saw that any sggregate function like count , max , min , avg doesn't use any VH, instead logically this must take metadata cache only. Please correct me if i am wrong.
@@vvkwadhwa you could be right and I need to check this once again. As per snowflake documentation what you says is right and I also tried couple of times but my result was different.
@@DataEngineering @Vivek Wadhwa What I noticed is when you do a min only on a date column, the metadata cache is used. You do it on a number column, e.g.: min(employee_id) from employees table and it would not use the metadata cache. Also, one query to @Data Engineering Simplified - When is the warehouse cache used? In what scenario. Also, want to thank you for a very thorough explanation of each and every concept.
@@DataEngineering Sorry for replying on 1 year back thread, I observed that when we use shared db tables for min,max,avg virtual warehouse is used and when we use table from the tables we created in db ,it does not use virtual warehouse.
Doest Metadata Cache stores the aggregated data? I found the below question with answer in one of the website... and it confused me about the concept of Metadata Cache.. Can you please let me know if the below answer is correct or not ? The Snowflake Metadata repository contains which of the following? 1. Table Definitions - True 2. Refrences to all of the mocropartion files of that table -true 3. Tracking the all versions of the table data within the data retention window -true 4. Aggregated Data from the table
Thank you for watching my video and sharing your query. The option 4 is not correct. Metadata captures the min/max but not aggregated data. Thank you for watching my video. I have already published other knowledge series and snowflake certification videos and if you are interested, you can refer them. And please share your contructive feedback so I can improvise my content and help snowflake and data engineering community. Complete Guide + Question Dump + Revised Syllabus -------------------------------------------------- 👉 SnowPro Guide -> bit.ly/35S7Rcb 👉 Revised Jan 2021 Syllabus -> bit.ly/3tqOkJb 👉 SnowPro Practice Test Playlist of 300 questions -> bit.ly/2ZLQm9E You can also follow Data Engineering Channel -------------------------------------------------- 👉 Twitter -> twitter.com/de_simplified 👉 Facebook -> fb.me/dataengineeringsimplified 👉 GitHub -> github.com/TopperTips 👉 Website -> toppertips.com
Hi, Q14, warehouse state has no effect on resulset cache since it's store at the Cloud Service Layer, not the Virtual Warehouse layer.....so the answer is FALSE
The question is bit confusing, the result cache is stored in cloud layer but virtual warehouse also has a cache and when the VWH is suspended, the result stored in VWH is aso cleared. In place of result, it should have been data.
Can we use result cache without having active WH?. In Snowflake WEBUI I have checked after running same query which I execute with in 24 hrs , WH state remained suspended but some where I read Warehouse needs to be Active..It would be great help if you can clear this doubt..
The purpose of cache it to reuse the result available in cloud service layer unless otherwise query is different or underlying data is changed. The cloud service layer (watch ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-IocdgUB94KQ.html if you hvae not seen the 3 layer architecture) knows when to use cache and when not.. and if not then it will ask for active warehouse.
yes, and you have potential only think to keep in mind keep videos short and modular make from beginner to advanced level keep good examples for shure your content on udemy will grow super data science, allan Rodriguez , learning journal, pirerian data , sundog are some of the successful earners who are earning millions of dollars from udemy.
@@DataEngineering Only B is valid. The notion of 59 minutes does not exists and you do not have to have the same role, the roles have simply to have sufficient access to the root tables/views