Тёмный
pvierth
pvierth
pvierth
Подписаться
I am an assistant professor of Chinese Studies and an affiliate in Data Science at William & Mary in Virginia and a former assistant professor of the Digital Humanities at Leiden University in the Netherlands (though this channel is not affiliated with either university). I am a scholar of late imperial Chinese literature and book history. On this channel I'll be posting videos related to the digital humanities, text mining, and general fun stuff!
A Quick Introduction to QGIS
18:34
4 года назад
Network Analysis with Python
13:38
4 года назад
Network Analysis with Gephi
18:28
4 года назад
Word Embedding Models in Python
20:21
4 года назад
Topic Modeling: Introduction
8:29
4 года назад
Explaining an art heist (badly)
0:40
5 лет назад
Episode 2: The Command Prompt
7:36
5 лет назад
Episode 16: Corpus Representation
31:16
5 лет назад
Episode 15: Pandas
30:12
5 лет назад
Episode 21: APIs and Webscraping
24:37
5 лет назад
Episode 20: Topic modeling
34:13
5 лет назад
Комментарии
@transcribemusic
@transcribemusic 3 месяца назад
This is amazing, man! I'm a musicologist and I'm currently working on my PhD project, investigating travelogues from the 16th to 19th century for intertextual references with regard to descriptions of musical encounters between colonialists and Indigenous people from the Americas. I would love to incorporate computational methods into my PhD, so your videos REALLY come in handy. Do you plan on continuing this series or is there a way of contacting you somehow? I would really love to get in touch and talk with you about how you would approach my research topic, especially since I'm very new to Python and programming.
@MCPetruk
@MCPetruk 10 месяцев назад
So after imputing all this data and creating networks...did he find out who the author was?
@richardholmes5676
@richardholmes5676 Год назад
Stylometry research from the Berkeley group found only a 1 in a 15 trillion chance of the Book of Mormon being a fabrication.
@tom66222
@tom66222 Год назад
Thank you! This is a great video. Super great starting guidance for me.
@helloafrica13
@helloafrica13 Год назад
Important! As of March, 2021 the LdaMallet model has been removed from Gensim and it does not support wrappers anymore.
@edwardisoghie3746
@edwardisoghie3746 Год назад
Please I need help oo. I have installed java and set the path environment variable to MALLET_HOME
@edwardisoghie3746
@edwardisoghie3746 Год назад
This is the error i am getting while implementing Mallet: AttributeError: module 'gensim.models' has no attribute 'wrappers'
@ZiluZheng
@ZiluZheng Год назад
《金瓶梅》😂
@coquequilotrouge4502
@coquequilotrouge4502 Год назад
Hi it's really interesting, thank you ! I try to do the same, but the articles that I use are pdf files, and it doesn't works with it. What format are the documents that you stocked and used for the word embedding ?
@MatyGti12
@MatyGti12 2 года назад
Hello, thanks for the tutorial, I thought that weight would mean the size of the circle for each name. Greetings
@300deQI
@300deQI 2 года назад
nice, thanks
@wasiahmad8642
@wasiahmad8642 2 года назад
Hi there would you please link your paper here...I wanted to submit a project on intertextuality. But I wanted to know a lot before I submitted the project to my professor. If I could read the paper it would've been great. TIA
@pvierth
@pvierth 2 года назад
Here you go: culturalanalytics.org/article/11054-a-blast-based-language-agnostic-text-reuse-algorithm-with-a-markus-implementation-and-sequence-alignment-optimized-for-large-chinese-corpora
@zongzheli1283
@zongzheli1283 2 года назад
Subscribed. He is A- Ma- Zing. Like I confirmed, amazing.
@lottaek1202
@lottaek1202 2 года назад
Your channel is a blessing. Just starting out in Digital Humanities, you have amazing and super helpful content.
@mrb85
@mrb85 2 года назад
Noice
@PutriUnawat
@PutriUnawat 2 года назад
Thanks for this video. It's very helpful :) I have the problem with my code. I have make mallet path, but there is problem with attributeError: module 'gensim.models' has no attribute 'wrappers'. Then I have to from gensim.models.wrappers import LdaMallet. I hope you can help my problem. Thanks
@rangerstunl3463
@rangerstunl3463 2 года назад
Thank you very much for this tutorial. As Gephi beginner this gave a very calm and collected yet detailed insight.
@erinlane4233
@erinlane4233 2 года назад
Thank you for all of these videos! They are wonderful!
@erinlane4233
@erinlane4233 2 года назад
This is a fantastic explanation of Topic Modeling. Thank you so much!
@guldariyabisenbay2769
@guldariyabisenbay2769 2 года назад
Hello, thank you for the informational video tutorial. Could you please send the link where we could find this dataset's .csv file?
@mahmood92456
@mahmood92456 2 года назад
thanks... very helpful
@joeleone6276
@joeleone6276 2 года назад
I learned more useful info about file manipulation in the first three minutes of this video than a whole semester class in college. Amazing!
@jackp4180
@jackp4180 2 года назад
hello Pvierth, I met some problem when I run G.add_node, and it said:IndexError: string index out of range, do u know how should I fix it?
@AMFLearning
@AMFLearning 2 года назад
nice amflearning by doing
@dimanhassan637
@dimanhassan637 3 года назад
Thanks a lot, but could you please make more videos about network analysis especially in co-authorship networks using python of course and networkx?
@kaoutarlanjri7412
@kaoutarlanjri7412 3 года назад
Very well explained! Thank you very much
@dreamdipesh
@dreamdipesh 3 года назад
Thanks a lot for this video. It was super helpful for my work on finding relationship between various materials. Keep up the good work.
@martinwutke3386
@martinwutke3386 3 года назад
Very good and straightforward introduction to networkx. Thanks for taking the time to record this video.
@marisamahilum1662
@marisamahilum1662 3 года назад
Hi. I'm having an error after running the python script. Traceback (most recent call last): File "tm.py", line 38, in <module> lda_model = gensim.models.wrappers.ldamallet.LdaMallet( File "C:\Users\mmb\anaconda3\lib\site-packages\gensim\models\wrappers\ldamallet.py", line 126, in __init__ self.train(corpus) File "C:\Users\mmb\anaconda3\lib\site-packages\gensim\models\wrappers\ldamallet.py", line 279, in train self.word_topics = self.load_word_topics() File "C:\Users\mmb\anaconda3\lib\site-packages\gensim\models\wrappers\ldamallet.py", line 337, in load_word_topics with utils.smart_open(self.fstate()) as fin: File "C:\Users\mmb\anaconda3\lib\site-packages\smart_open\smart_open_lib.py", line 138, in smart_open return file_smart_open(parsed_uri.uri_path, mode) File "C:\Users\mmb\anaconda3\lib\site-packages\smart_open\smart_open_lib.py", line 642, in file_smart_open return compression_wrapper(open(fname, mode), fname, mode) File "C:\Users\mmb\anaconda3\lib\site-packages\smart_open\smart_open_lib.py", line 630, in compression_wrapper return make_closing(GzipFile)(file_obj, mode) File "C:\Users\mmb\anaconda3\lib\gzip.py", line 173, in __init__ fileobj = self.myfileobj = builtins.open(filename, mode or 'rb') TypeError: expected str, bytes or os.PathLike object, not _io.BufferedReader
@Aruksa023
@Aruksa023 3 года назад
Great video! it helped me a lot, especially to wrap my head around QGIS after 2 years of not using any GIS software and barely knowing how to use ArcGIS xd. Thanks a lot😉 your explanations were super clear and strightforward🙂
@letconsultingpartnermexico6390
@letconsultingpartnermexico6390 3 года назад
can you share your Data set so that we can practice?
@kaoutarlanjri7412
@kaoutarlanjri7412 3 года назад
i think he has everything on his github account
@guldariyabisenbay2769
@guldariyabisenbay2769 2 года назад
Hello, were you able to find this dataset? The one in the github is not full, it is edited version. Just wondering if you found and could send the link if it has?
@guldariyabisenbay2769
@guldariyabisenbay2769 2 года назад
@@kaoutarlanjri7412 Hello, were you able to find this dataset? The one in the github is not full, it is edited version. Just wondering if you found and could send the link if it has?
@leopoldoe.torres6444
@leopoldoe.torres6444 2 года назад
No I couldn’t find the data set :/ however send me an email and I can share with you some practice datasets :)
@guldariyabisenbay2769
@guldariyabisenbay2769 2 года назад
@@leopoldoe.torres6444 I found another dataset and could practiced using it, but thank you a lot for your reply! Have a good day:)
@daveamiana778
@daveamiana778 3 года назад
Very clear presentation of ideas. Thanks for introducing this tool.
@luizaluiiza
@luizaluiiza 3 года назад
Hey, Paul, everything works, thank you a lot for your explanations and huge work!
@luizaluiiza
@luizaluiiza 3 года назад
Dear Paul, could you please tell us whether there's a possibility to sort the documents' names on x label alphabetically?
@luizaluiiza
@luizaluiiza 3 года назад
Hey, Paul! Thank you a lot for your video, it's very helpful! I'm a Mac user and I put mallet_path = "./bin/mallet" on line 35 instead of your code but I still get the error: "subprocess.CalledProcessError: Command './bin/mallet import-file --preserve-case --keep-sequence --remove-stopwords --token-regex "\S+" --input fed_corpus.txt --output fed_corpus.mallet' returned non-zero exit status 127". Maybe you have any ideas about how I could fix that? I don't understand what is exactly wrong.. I would be very thankful!
@pvierth
@pvierth 3 года назад
This suggests to me that gensim is not finding the mallet executable! Double check that the path to mallet is correct. As you currently have it, this suggest that then bin file is in the same directory as your code. I usually have it set up so that it is in the original folder it downloaded in (mallet-2.0.8). In that case your mallet path should be "./mallet-2.0.8/bin/mallet"
@luizaluiiza
@luizaluiiza 3 года назад
@@pvierth It worked!! Thank you 1000 times! Your videos are of such a great help :)
@weslleym.3594
@weslleym.3594 3 года назад
Great, great video! Thanks!
@jeremiahhorstick8041
@jeremiahhorstick8041 3 года назад
Thanks for this video!
@tomwongcy
@tomwongcy 3 года назад
Hi, if my data are points and I need to keep the two data which are overlap, how can I do. Thanks
@Diedeledie
@Diedeledie 3 года назад
I just want to comment this in case someone else runs into the same issue: just using the string 'C:\mallet\bin\mallet\' to specify the path to mallet didn't work for me due to a unicode error. This is because \u is a unicode escape and doesn't work if the next character isn't numeric. Just adding an additional \ should fix this (so, 'C:\\mallet\bin\mallet').
@kintag4459
@kintag4459 3 года назад
Thank you M.r
@kintag4459
@kintag4459 3 года назад
Thank you M.r
@loudelacruz2501
@loudelacruz2501 3 года назад
wondering why you separate the edges and nodes dataset
@josephniehaus9082
@josephniehaus9082 3 года назад
Incredibly helpful. Thank you so much
@DoctaFabulous
@DoctaFabulous 3 года назад
This video was really helpful I feel more comfortable diving into QGIS now.
@sumayaalazree673
@sumayaalazree673 3 года назад
Can the text be represented by a semantic network?
@p_272
@p_272 3 года назад
Hello, and thanks for these videos! I want to ask, did episode 2 disappear? I watched episode 1 but I could not find episode 2 on the video list
@pvierth
@pvierth 3 года назад
How odd! It is showing up in the playlist for me, but here is a direct link: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-fhsH4ua9zP8.html
@p_272
@p_272 3 года назад
@@pvierth I think it is because I was looking at the 'all videos' on the channel page, now I see it, thanks for the playlist link!
@Aspamir
@Aspamir 3 года назад
I appreciate your courses and the way you go about teaching. However, I wish in one of the episodes so far you covered how to open files from our local computers. This is perhaps very basic thing, but most people would want to immediately open their own files on the computers and play around.
@ariadnimak5622
@ariadnimak5622 3 года назад
could we use pandas to load the data?
@josephaghatise873
@josephaghatise873 3 года назад
You did a great work on this basic video well done!
@eltayebelhag6450
@eltayebelhag6450 3 года назад
How can i change the encoding of the graph like the color and maybe the node size?