European Bioinformatics Institute - EMBL-EBI

European Bioinformatics Institute - EMBL-EBI

336
586 583

Подписаться

We manage the world’s public biological data and make it freely available to the scientific community, provide professional training in bioinformatics, and perform computational biology research. Here you can find videos about our institute and many of our services, including UniProt, Ensembl, PDBe and Reactome.

AlphaMissense predictions for human genetic variation research

1:14:02

AlphaMissense predictions for human genetic variation research

2 месяца назад

Accessing sequence analysis tools via the new Job Dispatcher website

37:48

Accessing sequence analysis tools via the new Job Dispatcher website

3 месяца назад

Exploring human disease and protein variant data in UniProtKB

36:49

Exploring human disease and protein variant data in UniProtKB

3 месяца назад

Wine yeasts as a model system in community ecology

52:52

Wine yeasts as a model system in community ecology

3 месяца назад

Harmony in diversity Exploring the rich microbiomes of South American wildlife

58:15

Harmony in diversity Exploring the rich microbiomes of South American wildlife

3 месяца назад

Introduction to the Ensembl Bacteria genome browser

54:01

Introduction to the Ensembl Bacteria genome browser

4 месяца назад

Statistical thinking for microbial ecology

59:25

Statistical thinking for microbial ecology

4 месяца назад

Curating proteins involved in Antimicrobial Resistance AMR in UniProt

46:49

Curating proteins involved in Antimicrobial Resistance AMR in UniProt

4 месяца назад

Explore Human Pangenome Reference Consortium HPRC data in Ensembl

52:06

Explore Human Pangenome Reference Consortium HPRC data in Ensembl

4 месяца назад

The Pathogens Portal A gateway to vast biomolecular data for pathogen research

40:39

The Pathogens Portal A gateway to vast biomolecular data for pathogen research

4 месяца назад

Microbial biodiversity at schools and its link with children’s health

43:20

Microbial biodiversity at schools and its link with children’s health

5 месяцев назад

Strain resolved approaches for human microbiome studies

57:27

Strain resolved approaches for human microbiome studies

5 месяцев назад

Bioinformatic approaches to understand the role of the human microbiome in health and disease

55:33

Bioinformatic approaches to understand the role of the human microbiome in health and disease

5 месяцев назад

UniProt for Proteomics Scientists

53:22

UniProt for Proteomics Scientists

5 месяцев назад

Exploring metagenomes to assess microbiomes across the globe

56:00

Exploring metagenomes to assess microbiomes across the globe

5 месяцев назад

Investigating microbial ecology in extreme habitats

54:53

Investigating microbial ecology in extreme habitats

6 месяцев назад

Sequence analysis tools in UniProt BLAST and Align

41:13

Sequence analysis tools in UniProt BLAST and Align

6 месяцев назад

Exploring target-disease associations with the Open Targets Platform’s new interface

50:09

Exploring target-disease associations with the Open Targets Platform’s new interface

6 месяцев назад

EMBL-EBI: the home of big data in biology

2:31

EMBL-EBI: the home of big data in biology

6 месяцев назад

Automated annotation in UniProt

37:04

Automated annotation in UniProt

7 месяцев назад

An introduction to EMBL EBI resources

52:15

An introduction to EMBL EBI resources

7 месяцев назад

Uncovering protein function with UniProt

46:38

Uncovering protein function with UniProt

8 месяцев назад

Studying metabolites and small molecules with MetaboLights and ChEBI

37:28

Studying metabolites and small molecules with MetaboLights and ChEBI

8 месяцев назад

Accessing single cell transcriptome data with the Human Cell Atlas Data Portal

39:54

Accessing single cell transcriptome data with the Human Cell Atlas Data Portal

9 месяцев назад

Programmatic access to UniProt using Python

1:04:48

Programmatic access to UniProt using Python

9 месяцев назад

Complex Portal from proteins to complexes

39:46

Complex Portal from proteins to complexes

9 месяцев назад

Exploring protein families and domains using InterPro

33:48

Exploring protein families and domains using InterPro

9 месяцев назад

How to make your research open with Europe PMC

36:02

How to make your research open with Europe PMC

9 месяцев назад

Exploring protein sequences and functional annotations with UniProt

53:29

Exploring protein sequences and functional annotations with UniProt

9 месяцев назад

Комментарии

@duafatima6283 Месяц назад

Hi, thank you for the video. Which proteins are more reliable to analyze out of reviewed or unreviewed sets?

@EMBL-EBI Месяц назад

The reviewed dataset (UniProt/SwissProt) is a high quality manually annotated and non-redundant protein sequence database, which brings together experimental results, computed features and scientific conclusions. It contains protein sequences with evidence at the protein level. In SwissProt, each protein has been manually curated by expert curators based on: -Experiments described in peer-reviewed literature -Sequence and homology analysis The unreviewed dataset (UniProtKB/TrEMBL) contains many more sequences from various genome sequencing projects. TrEMBL contains high quality computationally analyzed records that are enriched with automatic annotation and classification. Sequences have not been manually reviewed by a curator and do not contain experimental annotations from literature. Annotations are based on automatic annotation systems that learn from SwissProt entries, such as UniRule and ARBA. Sequences may not have evidence at the protein level and some sequences may be incomplete (labeled as fragments). Ultimately, the choice between these datasets depends on the user's specific needs. Both the experimental-based annotations in SwissProt and the automatic annotation system in TrEMBL are considered reliable sources for protein feature annotations. SwissProt prioritizes accuracy and experimental validation, while TrEMBL offers a much larger dataset generated through automated methods.

@onesimemb102 Месяц назад

The very relevant aspects for visualization, thank you for this training.

@temitayoogundimu6294 Месяц назад

This is insightful.

@acrocent9788 2 месяца назад

Even though a vcf can easily be above 50 mb, ensembl only keeps a 50 mb limit when using their vep, is there another platform that takes vcfs for pathogenicity analysis which can take more import data?

@EnsemblHelpdesk 2 месяца назад

Hi, thank you for the query. Ensembl VEP also recognises compressed (gzipped) input files. Alternatively, you can provide a URL to the file location if your input file is bigger than 50MB in size, and Ensembl VEP is also available via the REST API and the command-line. I hope that this helps!

@omarmziouka4072 2 месяца назад

hello ! please How can I find motifs of a protein on UniProt ?

@EMBL-EBI 2 месяца назад

The 'Family and Domains' section of a UniProt entry provides information on sequence similarities with other proteins and the domain(s) present in a protein. The information is filed in different subsections, such as domain, repeat region, coiled coil and motif. These protein features can also be visualized in the Feature Viewer of a protein entry. The feature viewer allows to see all sequence features together in a visual manner. Features are arranged into categories such as domains and sites, motifs, molecule processing, post-translational modifications, mutagenesis, etc. The ruler on top represents the sequence length of the protein. By clicking on a feature, a tooltip will be shown with information on the feature and also highlight the sequence position of the feature. We hope this information is helpful for you.

@vondhanaramesh4365 3 месяца назад

Could you please let me know the detailed tutorial of chembl API

@EMBL-EBI 3 месяца назад

Hi there, thanks for your comments and interest in ChEMBL. Can you please email your query to chembl-help@ebi.ac.uk, where we can open a helpdesk ticket for you and share it with the team.

@vondhanaramesh4365 3 месяца назад

Currently I'm working with schisostoma mansoni, thank you so much!

@SidwellMafisa 3 месяца назад

I would like to cancel this nonsense as I didn't apply for it

@EMBL-EBI 3 месяца назад

Hi there. We're unsure what you want to cancel here? Our webinar videos are uploaded for free to RU-vid after recording. Perhaps you get alerts whenever we upload videos? If that is what you didn't want to see anymore, you will need to check into your individual RU-vid settings as this isn't something we as an account can control. I hope that helps.

@alpdinc-oran6684 4 месяца назад

great explanation. thank you for posting

@puitea_ralte 4 месяца назад

Thanks for sharing. I am Computer Science background, am doing PhD title on "An automated decision-making system to identify T2DM (type - II diabetes Miletus ) based on DNA sequences". i got data from which i have to figure it out diabetic variant from genomic dataset and i am stuck with it. i will be extremely glad if you can provides me some help in my research. Can you please drop some thing to contact you.

@student_remo 4 месяца назад

Thank you for this series! ❤

@student_remo 4 месяца назад

SRSF1 gene = Serine and Arginine Rich Splicing Factor 1.

@student_remo 4 месяца назад

MT-CO1 gene = Mitochondrially Encoded Cytochrome C Oxidase I.

@student_remo 4 месяца назад

I love the explanation of endosymbiont theory here. 🤍

@student_remo 4 месяца назад

“This gene encodes an enzyme involved in blood pressure regulation and electrolyte balance. It catalyzes the conversion of angiotensin I into a physiologically active peptide angiotensin II. Angiotensin II is a potent vasopressor and aldosterone-stimulating peptide that controls blood pressure and fluid-electrolyte balance. This angiotensin converting enzyme (ACE) also inactivates the vasodilator protein, bradykinin.” - National Institutes of Health, USA.

@student_remo 4 месяца назад

IL6 = interleukin 6. From “leukocyte” and the Greek language, leuk- “white”, cyt- “cell”.

@student_remo 4 месяца назад

“Fat mass and obesity associated (FTO) was the first gene found to be associated with obesity in three independent genome-wide association studies.” -NIH USA Gene full name: FTO alpha-ketoglutarate dependent dioxygenase.

@student_remo 4 месяца назад

“Cystic fibrosis is an inherited disease caused by mutations in a gene called the cystic fibrosis transmembrane conductance regulator (CFTR).” - National Institutes of Health, USA

@student_remo 4 месяца назад

18S rRNA = 18S ribosomal RNA.

@student_remo 4 месяца назад

“The TP53 gene provides instructions for making a protein called tumor protein p53 (or p53). This protein acts as a tumor suppressor, which means that it regulates cell division by keeping cells from growing and dividing (proliferating) too fast or in an uncontrolled way.” -MedlinePlus Genetics

@student_remo 4 месяца назад

XIST gene: X inactive specific transcript.

@student_remo 4 месяца назад

TTN is so big, what if we renamed “titin” into “titan”? 😅

@student_remo 4 месяца назад

HBB: Hemoglobin subunit beta gene. 🌬🩸

@student_remo 4 месяца назад

00:59 A gene is “a region of genome that makes a particular protein or functional RNA.”

@muhammednagas6311 5 месяцев назад

Was useful. Many thanks

@chidozienwanedo9234 5 месяцев назад

Nice presentation, Alex. Although I have got to work on coming to terms with the novel techniques you talked about. especially the use of drep on MAG. All the same, it was interesting to learn from you. Thanks!

@goodwork3980 6 месяцев назад

Great vidéo from Julia 🥰

@guihuajia7696 7 месяцев назад

My targets are not from CHEMBL but in other sources with their identifiers. How can I convert those identifiers of hundreds of targets into CHEMBL IDs?

@guihuajia7696 7 месяцев назад

pchembl_value__gte=5? under the threshold of: less than 10 um of potency? pchembl = - log10(10) =-1, if potence is great than 10 um ( ie. < 10 um), the pchembl should be >= -1. Right?

@guihuajia7696 7 месяцев назад

I got it wrong, the unit should be in molar concentration. then the cutoff is: pchembl_value__gte=5.

@MaxHumbertoCautiQuilcaro 7 месяцев назад

🎯 Key Takeaways for quick navigation: 00:00 🌱 *Dave Edwards, Director of the Center for Applied Bioinformatics at the University of Western Australia, discusses the intersection of pangenomics and machine learning for crop improvement.* 03:36 🌍 *The changing climate and growing global population are impacting agriculture. Shifts in rainfall patterns and temperature changes are affecting crop productivity, especially in food-insecure regions.* 05:42 🧬 *Genomics is crucial for improving crop productivity. Major crops need yield improvements and adaptation to climate change, while minor crops important for food security have great potential for improvement.* 08:58 🧬 *Sequencing technology has advanced significantly, becoming cheap and accessible. Next-generation sequencing and technologies like Oxford Nanopore and PacBio Sequel allow for cost-effective sequencing of diverse genomes.* 13:57 🌾 *Pangenomics involves understanding core genomes, variable genes, and dispensable genes in a species. A single reference genome doesn't represent the diversity, necessitating a pangenomic approach.* 16:51 🧩 *Building pan genomes involves an iterative assembly approach, utilizing a reference genome, mapping reads, assembling new contigs, and iteratively adding more data. Population graphs are now favored for their ability to capture more genomic information.* 18:29 📊 *Population graphs, especially in plant species, allow mapping data from hundreds or thousands of individuals to study genomic variation. They provide a comprehensive view of relationships between different parts of the genome.* 19:54 🧬 *Explored genomic diversity in Brassica species using pan-genomics, revealing significant variation in gene presence/absence.* 23:31 🧬 *Modeled genome sequencing to predict the number of genes in Brassica rapa, demonstrating the efficiency of capturing most genes with a relatively small number of individuals.* 25:51 🌱 *Identified disease resistance genes showing presence/absence variation in Brassica species, suggesting potential sources for crop improvement.* 26:19 🌾 *Explored Brassica napus (canola) pan-genome, highlighting substantial gene variation and the impact of polyploidy on gene redundancy.* 28:15 🤖 *Applied machine learning to understand gene loss mechanisms in Brassica species, revealing variable factors like chromosome position and homologous exchange.* 32:52 🌾 *Investigated wheat (bread wheat) pan-genome, emphasizing the limitations of using a single reference and the importance of pan-genomes for more accurate genomic studies.* 35:30 🌱 *Analyzed a soybean pan-genome with over a thousand individuals, uncovering gene frequency changes during domestication and breeding.* 37:39 🧬 *Explored reduction in gene content during domestication and breeding, indicating potential deleterious genes with no presence/absence variation that may be targeted using genome editing technologies.* 39:21 🌐 *Discussed the need for improved graph pan-genomes, data accessibility, and integration of diverse genomic information for more comprehensive analyses.* 40:06 🌾 *Machine learning can be applied to diverse data types in crop improvement, including crop images, genome sequences, and tabular data like yield statistics.* 41:29 🧠 *Multimodal deep learning involves building individual models for different data types (genomic variation, phenotype, environmental data) and combining them for predictions, allowing easier modification and fine-tuning.* 42:11 🌽 *Successful example: Using machine learning for yield prediction in Maize by analyzing drone images and manipulating them through rotation and other techniques.* 44:30 📊 *Classifying high-yielding lines early in crop development using machine learning, even without weather data, proves useful for breeders.* 45:13 🧬 *Machine learning and deep learning show promise in predicting traits in crops, with an example in soybean resequencing and the identification of important genomic loci.* 46:20 🌱 *Machine learning models, particularly XG Boost, aid in predicting gene content in canola, even for genes that are challenging to predict due to masking effects.* 47:04 🌾 *Quantitative disease resistance, such as blackleg in canola, can be predicted based on genotype, demonstrating the potential for machine learning in challenging scenarios.* 47:49 🔄 *Ongoing challenges and future directions include the need for better annotated pan-genome graphs, improved technology for building computational-efficient graphs, and the development of more advanced machine learning models for diverse data types.* 48:57 💻 *Collaborating with breeding companies and optimizing the path to breeding improved crops using bioinformatics is essential, emphasizing the importance of more data accessibility and usability.* 49:25 🌍 *Acknowledgment of the urgency in addressing climate change impacts on agriculture, highlighting the need for continuous innovation and collaboration in crop improvement efforts.* Made with HARPA AI

@keeperscoffin 7 месяцев назад

Terrific information! Thank you.

@bahaddinahmad5823 8 месяцев назад

Dream of working there?

@ibtissammaslouh3540 8 месяцев назад

Is ttn gene really fatal for newborns especially ?

8 месяцев назад

Thank you so much

@kruthiiirao 8 месяцев назад

👍

@TheLuikartLab 8 месяцев назад

Is there a good tutorial on how to analyze/visualize .raw files for people without proteomic experience?

@DeeptiJaiswal23 8 месяцев назад

Unfortunately we do not have any tutorial on visualisation of RAW files.

@marwatawfik3956 8 месяцев назад

Anyway to get technical support to get my 16S datasets? i find that difficult to follow.

@ArjunSingh-mv4es 9 месяцев назад

Can we annotate our fungus whole genome sequence here

@EMBL-EBI 9 месяцев назад

Thank you for the comment. Rfam can be used to annotate all fungal genomes, and we have some documentation here: docs.rfam.org/en/latest/genome-annotation.html, but you will have to run everything locally. We hope that helps!

@marwatawfik3956 9 месяцев назад

How to upload my 16S dataset?

@thesuprememat7119 9 месяцев назад

good video

@marijager700 9 месяцев назад

Thanks for the presentation! I've been doing GO and other functional analysis of proteomic data for several years, and still came by several useful tips and tricks in this video :).

@EMBL-EBI 9 месяцев назад

That's great to hear, thanks for sharing!

@jayasuriyajm3029 9 месяцев назад

@18.27 isn't specific parents gives to broader children, since arrow is pointing towards biological processes?

@bbarry083 9 месяцев назад

Greetings from Ethiopia 🇪🇹

@user-ru7rc6om1x 10 месяцев назад

unclear accent

@kellihendrix9637 10 месяцев назад

💘 "Promo SM"

@keenviewer 11 месяцев назад

Very informative - thank you. I have had success the conda package for VEP: conda create -n VEP109 conda activate VEP109 conda install ensembl-vep=109.3 (latest at time of installation) conda install perl-compress-raw-zlib=2.202 An additional step was required (suggested during installation of the above) to install cache data. Here I installed human GCRh38: vep_install -a cf -s homo_sapiens -y GRCh38 -c ~/.conda/envs/VEP109/ ~/.conda/envs/VEP109/GRCh38/ --CONVERT --PLUGINS all

11 месяцев назад

Thank you so much. Greetings from Molecular Biology, Environment and Cancer Research Group at Universidad del Cauca, Colombia.

11 месяцев назад

Thank you EMBL - EBI for this useful video. Greetings from a bioeng graduate student.