Тёмный
All I know about Stata - Data Management
All I know about Stata - Data Management
All I know about Stata - Data Management
Подписаться
I want to share all I know about managing data with Stata. Stata has some super powers I will love to share with the world.
Why do we have errors in #Stata?
14:52
Год назад
Workflow - Managing folders
5:17
Год назад
Renaming Variables in #Stata
10:54
Год назад
Комментарии
@aberaaddisu7086
@aberaaddisu7086 3 месяца назад
So beright thinking
@ritaawotu7354
@ritaawotu7354 4 месяца назад
My system is not showing the evaluate stata
@datawithstata
@datawithstata 4 месяца назад
Have you clicked the link?
@eliasandersson1522
@eliasandersson1522 4 месяца назад
@@datawithstata does not work :/
@jorgeportillo5250
@jorgeportillo5250 4 месяца назад
Hi. Just to mention that when one is watching the playlist for Module 5, at the end of Lesson 3 it jumps directly to Lesson 5 (skipping Lesson 4). Otherwise, it's great material, keep up the good work.
@datawithstata
@datawithstata 4 месяца назад
Thank you for the feedback. Very useful. I am grateful.
@IyasuAsmamaw
@IyasuAsmamaw 5 месяцев назад
Where do you I got serial number to register in stata
@datawithstata
@datawithstata 5 месяцев назад
It would be sent to your email. If you filled the form over the weekend, you get a response on Monday
@HarunaUmar-cw7gq
@HarunaUmar-cw7gq 9 месяцев назад
I really got more experience about stamil
@johnomoluabiyoutube
@johnomoluabiyoutube 10 месяцев назад
%%time c=0 for i in range(1001,5001): t=pd.read_excel(f"WRA\WRA_{i}.xlsx") c+=t[(t.age>=15) & (t.age<=19)].shape[0] print(f"adolescent_count: {c}") #result adolescent_count: 57185 #time CPU times: total: 1min 2s Wall time: 1min 6s I tried your exact implementation but in python. Wanted to try and see which would be slower, knowing that python is quite slow. Thanks for this sir, I have learnt a lot about stata from your channel.
@rakiyapedru
@rakiyapedru 10 месяцев назад
I got my download details in about an hour. So glad they responded in no time being that it's a Friday. Many thanks for this info. and looking forward to kick-starting my lessons.
@datawithstata
@datawithstata 10 месяцев назад
Glad I could help!
@amara8482
@amara8482 Год назад
Nice explanation sir. Do you have a video on analysing the multiple-choice questions in Stata?
@datawithstata
@datawithstata Год назад
I do not have any videos. What type of analysis do you have in mind so I can make it?
@amara8482
@amara8482 Год назад
I would appreciate that very much! If each answer to the multiple choice question appear as one column, how do one sort out the information? Considering one person can give multiple answers. Use multiple medications for instance...idk of I explained my confusion well or not.
@datawithstata
@datawithstata Год назад
@@amara8482 There are a couple of ways to do this. Please fill this form so I can reach you for further clarification ( forms.gle/FTX479KFf2PtgxWA8 )
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Understood
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
understood
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Can we say inferential statistics is akin to a sampled survey?
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Understood.
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Understood. Thank you
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Great lecture again. 4 types of relationships well explained.
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Although I wish you talked more on the principle of database
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
Great insight here. There's a database everywhere is we look closely. Nice one
@olu-ajayijudah2686
@olu-ajayijudah2686 Год назад
I am excited to begin this journey
@tshegofatsomogaladi6728
@tshegofatsomogaladi6728 Год назад
My professor passed a license he got from the university to me. He gave me user name and password word and when I try to download there is a serial number field . I do not have the serial number as part of the credential give to me.. Pls advise
@datawithstata
@datawithstata Год назад
Hello, many thanks for your enquiries. There is a pdf file your professor was given at the point of purchase. That where you get the serial number
@mohammadenamulhuq9133
@mohammadenamulhuq9133 Год назад
is it Macbook friendly ?
@datawithstata
@datawithstata Год назад
Yes. It is . The only slight difference you will notice is how file paths are referenced.
@promiseoduola5887
@promiseoduola5887 Год назад
Hello, Thank you so much. I am done with this beginner's course, It has been an interesting and inspiring journey. Please how will i get the evaluation link.
@kennyputers
@kennyputers Год назад
Thank you for sharing, Sir
@prosperp9935
@prosperp9935 Год назад
can you provide a link to your data files?
@kennyputers
@kennyputers Год назад
Good Morning Sir, I'm sharing my output with the links below: dofile (16 lines): drive.google.com/open?id=1PuUzXZmPsOJNyd_5qfBdbtY0La7fcfDG&usp=drive_fs Excel file: docs.google.com/spreadsheets/d/1R1OmBl9E3ehthtjEp1XOka8vCkiqclMrm_VQGNEK98M/edit?usp=sharing My Experience: Spent about 30mins writing the lines. Exporting the Excel file took a while (might be a fault from my laptop though). Many thanks for the opportunity, Sir.
@kennyputers
@kennyputers Год назад
Well-done sir. I will share my output shortly
@datawithstata
@datawithstata Год назад
Please do!
@otitojuoluwatobilobajoy2113
Weldone sir. Please I need a dataset to work with. I am currently in module 2. However there is no data to practically work with
@datawithstata
@datawithstata Год назад
We are happy to serve. Many thanks for your feedback . Most of the datasets used can obtained using the *sysuse* command - You may gain access to the datasets via this Dropbox link. www.dropbox.com/scl/fo/71jd6ocstpactkbk9310b/h?rlkey=gvqbupez8nhwaw6k3jzx96nq7&dl=0
@BlamloveGad-nf1ui
@BlamloveGad-nf1ui Год назад
Very impactful and simplified Please can we have the data set...
@datawithstata
@datawithstata Год назад
Apologies for delay in response. Use the *_sysuse_* command to get the *auto.dta* Check the description for the link (www.dropbox.com/scl/fo/71jd6ocstpactkbk9310b/h?rlkey=gvqbupez8nhwaw6k3jzx96nq7&dl=0 )
@victorrodriguez5981
@victorrodriguez5981 Год назад
they gave me 7 days only did set something different to get 30 days, thanks for the video
@datawithstata
@datawithstata Год назад
Please I do not understand. Where you given 7 days or 30 days?
@victorrodriguez5981
@victorrodriguez5981 Год назад
@@datawithstata i was given 7 days, my bad, typo. Did u set something different on the formulary to get 30 days?
@datawithstata
@datawithstata Год назад
Before the 7 days expire, ask for more time. State what you are using it for.
@datawithstata
@datawithstata Год назад
Hi have you written for the extension.
@CHINEDUEMMANUELODAH
@CHINEDUEMMANUELODAH Год назад
Thank you for the lesson. It is well explained.
@datawithstata
@datawithstata Год назад
Glad it was helpful!
@lizzyigbinaduwa6765
@lizzyigbinaduwa6765 Год назад
Thanks alot for these lectures. Please how do i get the data for the assignment?
@datawithstata
@datawithstata Год назад
In the description
@datawithstata
@datawithstata Год назад
www.dropbox.com/scl/fo/4stkkb0rr6svix7gd8rtp/h?rlkey=mzj72cvdaesddwzntp32eomyu&dl=0
@pradipamratlal5514
@pradipamratlal5514 Год назад
Caralho.
@kennyputers
@kennyputers Год назад
Thanks for this, boss
@datawithstata
@datawithstata Год назад
Any time
@datawithstata
@datawithstata Год назад
The original teaser can be found here - ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-T99SlrFmGPY.html
@datawithstata
@datawithstata Год назад
The solution to this teaser can be found here..ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html
@ameniconnecttoallthetestim5831
Thank you very much sir, the videos from module1 have been helpful
@datawithstata
@datawithstata Год назад
You are most welcome
@nenedavies7923
@nenedavies7923 Год назад
Thank you so much
@datawithstata
@datawithstata Год назад
You're most welcome. Let me know if you have any questions or challenges
@nohemibarrios9220
@nohemibarrios9220 Год назад
I just ask for it, let's hope the approve it. Thanks a lot for the information, dr!
@datawithstata
@datawithstata Год назад
I know they will. Please check back late on Monday. You may not get a response over the weekend
@datawithstata
@datawithstata Год назад
Any response yet?
@nohemibarrios9220
@nohemibarrios9220 Год назад
@@datawithstata They already did, thanks!
@datawithstata
@datawithstata Год назад
@@nohemibarrios9220 Great! Make sure you maximize the opportunity!
@kayodeabe
@kayodeabe Год назад
Lesson 2
@datawithstata
@datawithstata Год назад
ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-5UQ6j0EI9gM.html
@kennyputers
@kennyputers Год назад
Many thanks for sharing this insight sir.
@datawithstata
@datawithstata Год назад
Glad it was helpful! I will will be sharing the solution within the week!
@kennyputers
@kennyputers Год назад
Using Stata: clear **importing the excel document import excel "C:\Users\DELL\Downloads\Club_teaser.xlsx", sheet("Sheet1") firstrow **formatting the date variables generate date_var = ustrregexra(month,"(.)","$1,") split date_var , generate(Var) parse(",") gen new_month=Var1+Var2+Var3 tostring day year, replace gen date= day + new_month+ year gen new_date= date(date, "DMY") format new_date %td destring year, replace **sorting the data by date sort new_date bysort year: gen bisi=_n **exporting the registration code of the eligible members export excel month day year registeration_code using "C:\Olabisi\eligible_members.xlsx" if bisi==1, sheet("eligible_members") firstrow(variables) sheetreplace datestring("%tc")
@BabatundeKAkano
@BabatundeKAkano Год назад
Wow! Interesting responses here!
@datawithstata
@datawithstata Год назад
Everything is almost perfect here ! See the solution here - ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html
@datawithstata
@datawithstata Год назад
You could have collapsed 3 lines into 1
@kennyputers
@kennyputers Год назад
@datawithstata Yes boss... this is noted sir. Thanks for the solution to the teaser
@johnomoluabiyoutube
@johnomoluabiyoutube Год назад
import pandas as pd pd.get_option("display.max_rows",None) df=pd.read_excel("/content/drive/MyDrive/Club_teaser.xlsx") df = df.astype(str) df["reg_date"]=pd.to_datetime(df["Year of Registration"]+df["Month of Registration"]+df["Day of Registration"],format='%Y%B%d') selected=df.groupby(["Year of Registration"]).first() print(f"{selected['Registration ID'].count()} members selected") #export to excel selected.to_excel("/content/drive/MyDrive/club_award_winners.xlsx")
@datawithstata
@datawithstata Год назад
Have you generated the Excel output? (That line of code is missing )That is the sure way to verify that you are correct
@johnomoluabiyoutube
@johnomoluabiyoutube Год назад
@@datawithstata I just edited it. I shared a Google colab with you via email so that you can see the code run
@johnomoluabiyoutube
@johnomoluabiyoutube Год назад
No video
@datawithstata
@datawithstata Год назад
Hi! I don't understand
@johnomoluabiyoutube
@johnomoluabiyoutube Год назад
@@datawithstata sorry, it was my RU-vid app's fault
@johnomoluabiyoutube
@johnomoluabiyoutube Год назад
This is one of the first topics you educated me about. It really went a long way, even though I was more of a software developer. It helped me understand database cardinality and constructs of entity relationship diagrams in data modelling
@henryegbelo7864
@henryegbelo7864 Год назад
2) Using Python import pandas as pd # Load Dataset - line 1 df = pd.read_excel(r"C:\Users\Henry\Downloads\Club_teaser.xlsx") # Convert variables to string - line 2 df = df.astype(str) # Genearate registration date line 3 df['RegistrationDate'] = pd.to_datetime(df['Day of Registration'].str.split('.').str[0] + ' ' + df['Month of Registration'] + ' ' + df['Year of Registration'].str.split('.').str[0], format='%d %B %Y') # Sort by registration date line 4 df = df.sort_values('RegistrationDate').reset_index() # Select the first 100 registrations - line 5 df.head(100)
@datawithstata
@datawithstata Год назад
How do you export to excel?
@henryegbelo7864
@henryegbelo7864 Год назад
@@datawithstata # Export the selected registrations to an Excel file first_100_registrations.to_excel(r'C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\selected_registrations.xlsx', index=False)
@henryegbelo7864
@henryegbelo7864 Год назад
Please Ignore this first i wasn't thinking...I jumped into your trap sir //Load Dataset import excel "C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\Club_teaser.xlsx", sheet("Sheet1") firstrow clear //Convert variables to string tostring *, replace //Genearate registration date gen RegistrationDate = date(DayofRegistration + " " + MonthofRegistration + " " + YearofRegistration, "DMY") format RegistrationDate %td //Sort by registration date sort RegistrationDate //Select the first 60 registrations bysort Year: keep if _n == 1 keep in 1/60 //E xport the first 60 registration to excel export excel RegistrationID using "C:\Users\Henry\OneDrive\Documents\DATA SCIENCE TUTORIAL\STATA\Club Teaser\Selected RegistrationID_stata3.xlsx", sheetreplace firstrow(variables)
@datawithstata
@datawithstata Год назад
@henryegbelo7864 Well done. Everything was perfect _until_ *keep in 1/60*
@datawithstata
@datawithstata Год назад
The solution can be found here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-A_8ICvAdGVc.html
@olowonirejuarooluwadunsin9913
hello sir, am unable to get the link for the stata download sir
@datawithstata
@datawithstata Год назад
ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-3rOLtx1c23Y.html
@datawithstata
@datawithstata Год назад
www.stata.com/customer-service/evaluate-stata/
@datawithstata
@datawithstata Год назад
Watch the video and fill the form according to instructions
@saundrablakeslee3620
@saundrablakeslee3620 Год назад
P r o m o S M
@ADEMOLAOLATUNDE-k2s
@ADEMOLAOLATUNDE-k2s Год назад
hello sir, i can not find lesson 8 for module 1..........................ademola olatunde #SOFTRAYS
@ADEMOLAOLATUNDE-k2s
@ADEMOLAOLATUNDE-k2s Год назад
lesson 8 seen