All I know about Stata - Data Management

All I know about Stata - Data Management

100
73 382

Подписаться

I want to share all I know about managing data with Stata. Stata has some super powers I will love to share with the world.

PART 2: How do you solve his #Stata challenges with less than 20 lines?

5:51

PART 2: How do you solve his #Stata challenges with less than 20 lines?

Год назад

PART 1: How do you solve his #Stata challenges with less than 20 lines?

4:59

PART 1: How do you solve his #Stata challenges with less than 20 lines?

Год назад

See what #Stata did with 4,000 Excel files in 2 MINUTES!

15:00

See what #Stata did with 4,000 Excel files in 2 MINUTES!

Год назад

How to download ORIGINAL & LEGAL #Stata Software for less than $50.

4:56

How to download ORIGINAL & LEGAL #Stata Software for less than $50.

Год назад

Getting information from connected datasets

9:37

Getting information from connected datasets

Год назад

WOW! No need to spell variable names FULLY in order to use them with #Stata Commands

2:27

WOW! No need to spell variable names FULLY in order to use them with #Stata Commands

Год назад

This tip will help you QUICKLY locate variables in #Stata Data Editor

2:09

This tip will help you QUICKLY locate variables in #Stata Data Editor

Год назад

Solve this complex data challenge using 7 lines of Stata code

13:45

Solve this complex data challenge using 7 lines of Stata code

Год назад

Final Lesson - Where do we go from here?

8:43

Final Lesson - Where do we go from here?

Год назад

Next Steps for building competence in Data Management

3:36

Next Steps for building competence in Data Management

Год назад

Skills required for Data Management in #Stata

9:04

Skills required for Data Management in #Stata

Год назад

Data Ethics when using #Stata for Data Management

4:05

Data Ethics when using #Stata for Data Management

Год назад

Advanced Tips & Concepts for Data Management in #Stata

5:38

Advanced Tips & Concepts for Data Management in #Stata

Год назад

Understanding Domains & Context in Data Management

9:12

Understanding Domains & Context in Data Management

Год назад

Why do we have errors in #Stata?

14:52

Why do we have errors in #Stata?

Год назад

When cleaning may be impossible in #Stata

3:20

When cleaning may be impossible in #Stata

Год назад

Workflow - Managing folders

5:17

Workflow – Managing folders

Год назад

Workflow Models for monitoring cumulative data collection in #Stata

10:23

Workflow Models for monitoring cumulative data collection in #Stata

Год назад

Strategies and Concepts for data management using #Stata

4:20

Strategies and Concepts for data management using #Stata

Год назад

See how #Stata can resolve this #complex problem in less than 10 lines

17:34

See how #Stata can resolve this #complex problem in less than 10 lines

Год назад

#Stata best practices : Data Documentation

5:36

#Stata best practices : Data Documentation

Год назад

3 Factors to consider when finalizing data in #Stata

7:20

3 Factors to consider when finalizing data in #Stata

Год назад

Renaming Variables in #Stata

10:54

Renaming Variables in #Stata

Год назад

Using #Stata to drop wanted variables and observation during cleaning

1:44

Using #Stata to drop wanted variables and observation during cleaning

Год назад

Adding labels to variables, data and value sets in #Stata

2:04

Adding labels to variables, data and value sets in #Stata

Год назад

These two powerful #Stata commands for cleaning : #recode and #replace

25:56

These two powerful #Stata commands for cleaning : #recode and #replace

Год назад

Basic data cleaning principles in #Stata

10:29

Basic data cleaning principles in #Stata

Год назад

Overview of data cleaning using #Stata

2:57

Overview of data cleaning using #Stata

Год назад

Practical ways of formatting date and time variables in #Stata

23:32

Practical ways of formatting date and time variables in #Stata

Год назад

Комментарии

@aberaaddisu7086 3 месяца назад

So beright thinking

@ritaawotu7354 4 месяца назад

My system is not showing the evaluate stata

@datawithstata 4 месяца назад

Have you clicked the link?

@eliasandersson1522 4 месяца назад

@@datawithstata does not work :/

@jorgeportillo5250 4 месяца назад

Hi. Just to mention that when one is watching the playlist for Module 5, at the end of Lesson 3 it jumps directly to Lesson 5 (skipping Lesson 4). Otherwise, it's great material, keep up the good work.

@datawithstata 4 месяца назад

Thank you for the feedback. Very useful. I am grateful.

@IyasuAsmamaw 5 месяцев назад

Where do you I got serial number to register in stata

@datawithstata 5 месяцев назад

It would be sent to your email. If you filled the form over the weekend, you get a response on Monday

@HarunaUmar-cw7gq 9 месяцев назад

I really got more experience about stamil

@johnomoluabiyoutube 10 месяцев назад

%%time c=0 for i in range(1001,5001): t=pd.read_excel(f"WRA\WRA_{i}.xlsx") c+=t[(t.age>=15) & (t.age<=19)].shape[0] print(f"adolescent_count: {c}") #result adolescent_count: 57185 #time CPU times: total: 1min 2s Wall time: 1min 6s I tried your exact implementation but in python. Wanted to try and see which would be slower, knowing that python is quite slow. Thanks for this sir, I have learnt a lot about stata from your channel.

@rakiyapedru 10 месяцев назад

I got my download details in about an hour. So glad they responded in no time being that it's a Friday. Many thanks for this info. and looking forward to kick-starting my lessons.

@datawithstata 10 месяцев назад

Glad I could help!

@amara8482 Год назад

Nice explanation sir. Do you have a video on analysing the multiple-choice questions in Stata?

@datawithstata Год назад

I do not have any videos. What type of analysis do you have in mind so I can make it?

@amara8482 Год назад

I would appreciate that very much! If each answer to the multiple choice question appear as one column, how do one sort out the information? Considering one person can give multiple answers. Use multiple medications for instance...idk of I explained my confusion well or not.

@datawithstata Год назад

@@amara8482 There are a couple of ways to do this. Please fill this form so I can reach you for further clarification ( forms.gle/FTX479KFf2PtgxWA8 )

@olu-ajayijudah2686 Год назад

Understood

@olu-ajayijudah2686 Год назад

understood

@olu-ajayijudah2686 Год назад

Can we say inferential statistics is akin to a sampled survey?

@olu-ajayijudah2686 Год назад

Understood.

@olu-ajayijudah2686 Год назад

Understood. Thank you

@olu-ajayijudah2686 Год назад

Great lecture again. 4 types of relationships well explained.

@olu-ajayijudah2686 Год назад

Although I wish you talked more on the principle of database

@olu-ajayijudah2686 Год назад

Great insight here. There's a database everywhere is we look closely. Nice one

@olu-ajayijudah2686 Год назад

I am excited to begin this journey

@tshegofatsomogaladi6728 Год назад

My professor passed a license he got from the university to me. He gave me user name and password word and when I try to download there is a serial number field . I do not have the serial number as part of the credential give to me.. Pls advise

@datawithstata Год назад

Hello, many thanks for your enquiries. There is a pdf file your professor was given at the point of purchase. That where you get the serial number

@mohammadenamulhuq9133 Год назад

is it Macbook friendly ?

@datawithstata Год назад

Yes. It is . The only slight difference you will notice is how file paths are referenced.

@promiseoduola5887 Год назад

Hello, Thank you so much. I am done with this beginner's course, It has been an interesting and inspiring journey. Please how will i get the evaluation link.

@kennyputers Год назад

Thank you for sharing, Sir

@prosperp9935 Год назад

can you provide a link to your data files?

@kennyputers Год назад

Good Morning Sir, I'm sharing my output with the links below: dofile (16 lines): drive.google.com/open?id=1PuUzXZmPsOJNyd_5qfBdbtY0La7fcfDG&usp=drive_fs Excel file: docs.google.com/spreadsheets/d/1R1OmBl9E3ehthtjEp1XOka8vCkiqclMrm_VQGNEK98M/edit?usp=sharing My Experience: Spent about 30mins writing the lines. Exporting the Excel file took a while (might be a fault from my laptop though). Many thanks for the opportunity, Sir.

@kennyputers Год назад

Well-done sir. I will share my output shortly

@datawithstata Год назад

Please do!

@otitojuoluwatobilobajoy2113 Год назад

Weldone sir. Please I need a dataset to work with. I am currently in module 2. However there is no data to practically work with

@datawithstata Год назад

We are happy to serve. Many thanks for your feedback . Most of the datasets used can obtained using the *sysuse* command - You may gain access to the datasets via this Dropbox link. www.dropbox.com/scl/fo/71jd6ocstpactkbk9310b/h?rlkey=gvqbupez8nhwaw6k3jzx96nq7&dl=0

@BlamloveGad-nf1ui Год назад

Very impactful and simplified Please can we have the data set...

@datawithstata Год назад

Apologies for delay in response. Use the *_sysuse_* command to get the *auto.dta* Check the description for the link (www.dropbox.com/scl/fo/71jd6ocstpactkbk9310b/h?rlkey=gvqbupez8nhwaw6k3jzx96nq7&dl=0 )

@victorrodriguez5981 Год назад

they gave me 7 days only did set something different to get 30 days, thanks for the video

@datawithstata Год назад

Please I do not understand. Where you given 7 days or 30 days?

@victorrodriguez5981 Год назад

@@datawithstata i was given 7 days, my bad, typo. Did u set something different on the formulary to get 30 days?

@datawithstata Год назад

Before the 7 days expire, ask for more time. State what you are using it for.

@datawithstata Год назад

Hi have you written for the extension.

@CHINEDUEMMANUELODAH Год назад

Thank you for the lesson. It is well explained.

@datawithstata Год назад

Glad it was helpful!

@lizzyigbinaduwa6765 Год назад

Thanks alot for these lectures. Please how do i get the data for the assignment?

@datawithstata Год назад

In the description

@datawithstata Год назад

www.dropbox.com/scl/fo/4stkkb0rr6svix7gd8rtp/h?rlkey=mzj72cvdaesddwzntp32eomyu&dl=0

@pradipamratlal5514 Год назад

Caralho.

@kennyputers Год назад

Thanks for this, boss

@datawithstata Год назад

Any time

@datawithstata Год назад

The original teaser can be found here - ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-T99SlrFmGPY.html

@datawithstata Год назад

The solution to this teaser can be found here..ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html

@ameniconnecttoallthetestim5831 Год назад

Thank you very much sir, the videos from module1 have been helpful

@datawithstata Год назад

You are most welcome

@nenedavies7923 Год назад

Thank you so much

@datawithstata Год назад

You're most welcome. Let me know if you have any questions or challenges

@nohemibarrios9220 Год назад

I just ask for it, let's hope the approve it. Thanks a lot for the information, dr!

@datawithstata Год назад

I know they will. Please check back late on Monday. You may not get a response over the weekend

@datawithstata Год назад

Any response yet?

@nohemibarrios9220 Год назад

@@datawithstata They already did, thanks!

@datawithstata Год назад

@@nohemibarrios9220 Great! Make sure you maximize the opportunity!

@kayodeabe Год назад

Lesson 2

@datawithstata Год назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-5UQ6j0EI9gM.html

@kennyputers Год назад

Many thanks for sharing this insight sir.

@datawithstata Год назад

Glad it was helpful! I will will be sharing the solution within the week!

@kennyputers Год назад

Using Stata: clear **importing the excel document import excel "C:\Users\DELL\Downloads\Club_teaser.xlsx", sheet("Sheet1") firstrow **formatting the date variables generate date_var = ustrregexra(month,"(.)","$1,") split date_var , generate(Var) parse(",") gen new_month=Var1+Var2+Var3 tostring day year, replace gen date= day + new_month+ year gen new_date= date(date, "DMY") format new_date %td destring year, replace **sorting the data by date sort new_date bysort year: gen bisi=_n **exporting the registration code of the eligible members export excel month day year registeration_code using "C:\Olabisi\eligible_members.xlsx" if bisi==1, sheet("eligible_members") firstrow(variables) sheetreplace datestring("%tc")

@BabatundeKAkano Год назад

Wow! Interesting responses here!

@datawithstata Год назад

Everything is almost perfect here ! See the solution here - ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html

@datawithstata Год назад

You could have collapsed 3 lines into 1

@kennyputers Год назад

@datawithstata Yes boss... this is noted sir. Thanks for the solution to the teaser

@johnomoluabiyoutube Год назад

import pandas as pd pd.get_option("display.max_rows",None) df=pd.read_excel("/content/drive/MyDrive/Club_teaser.xlsx") df = df.astype(str) df["reg_date"]=pd.to_datetime(df["Year of Registration"]+df["Month of Registration"]+df["Day of Registration"],format='%Y%B%d') selected=df.groupby(["Year of Registration"]).first() print(f"{selected['Registration ID'].count()} members selected") #export to excel selected.to_excel("/content/drive/MyDrive/club_award_winners.xlsx")

@datawithstata Год назад

Have you generated the Excel output? (That line of code is missing )That is the sure way to verify that you are correct

@johnomoluabiyoutube Год назад

@@datawithstata I just edited it. I shared a Google colab with you via email so that you can see the code run

@johnomoluabiyoutube Год назад

No video

@datawithstata Год назад

Hi! I don't understand

@johnomoluabiyoutube Год назад

@@datawithstata sorry, it was my RU-vid app's fault

@johnomoluabiyoutube Год назад

This is one of the first topics you educated me about. It really went a long way, even though I was more of a software developer. It helped me understand database cardinality and constructs of entity relationship diagrams in data modelling

@henryegbelo7864 Год назад

2) Using Python import pandas as pd # Load Dataset - line 1 df = pd.read_excel(r"C:\Users\Henry\Downloads\Club_teaser.xlsx") # Convert variables to string - line 2 df = df.astype(str) # Genearate registration date line 3 df['RegistrationDate'] = pd.to_datetime(df['Day of Registration'].str.split('.').str[0] + ' ' + df['Month of Registration'] + ' ' + df['Year of Registration'].str.split('.').str[0], format='%d %B %Y') # Sort by registration date line 4 df = df.sort_values('RegistrationDate').reset_index() # Select the first 100 registrations - line 5 df.head(100)

@datawithstata Год назад

How do you export to excel?

@henryegbelo7864 Год назад

@@datawithstata # Export the selected registrations to an Excel file first_100_registrations.to_excel(r'C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\selected_registrations.xlsx', index=False)

@henryegbelo7864 Год назад

Please Ignore this first i wasn't thinking...I jumped into your trap sir //Load Dataset import excel "C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\Club_teaser.xlsx", sheet("Sheet1") firstrow clear //Convert variables to string tostring *, replace //Genearate registration date gen RegistrationDate = date(DayofRegistration + " " + MonthofRegistration + " " + YearofRegistration, "DMY") format RegistrationDate %td //Sort by registration date sort RegistrationDate //Select the first 60 registrations bysort Year: keep if _n == 1 keep in 1/60 //E xport the first 60 registration to excel export excel RegistrationID using "C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\Selected RegistrationID_stata3.xlsx", sheetreplace firstrow(variables)

@datawithstata Год назад

@henryegbelo7864 Well done. Everything was perfect _until_ *keep in 1/60*

@datawithstata Год назад

The solution can be found here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html

@olowonirejuarooluwadunsin9913 Год назад

hello sir, am unable to get the link for the stata download sir

@datawithstata Год назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-3rOLtx1c23Y.html

@datawithstata Год назад

www.stata.com/customer-service/evaluate-stata/

@datawithstata Год назад

Watch the video and fill the form according to instructions

@saundrablakeslee3620 Год назад

P r o m o S M

@ADEMOLAOLATUNDE-k2s Год назад

hello sir, i can not find lesson 8 for module 1..........................ademola olatunde #SOFTRAYS

@ADEMOLAOLATUNDE-k2s Год назад

lesson 8 seen