Did you find value in this End-to-end tutorial in Bioinformatics/Cheminformatics? If you would like more videos like this please give it a 👍Like and ❤️Subscribe to the channel. Please comment down below your thoughts and suggestions 👇
@@Kelrash31 Hi Alain, it's been slow but consistent. I've been working on QSAR and Docking at the moment. Still haven get that much into scripting for data managing but it's in the future plans.
@@DataProfessor Hi I am enthusiastic to learn QSAR and MD simulation for Protein Degraders. Do you recommend particular blog or any Book to learn data science for drug design/optimization. Your lectures/explanation are fantastic, getting enlightened all the pros/cros about CADD. Thank you very much
That's awesome Michael! Speaking of drug discovery, I have a several more videos covering the topic here, you are more than welcome to check them out bit.ly/dataprofessor-bioinformatics
i m a student in india i have so much interest in bioinformatics and i found u thank u so much.... I m currently studying to get in a college for bioinfo undergrad
@@DataProfessor Great Video! What books you recommend me? I am going to start my PhD in Drug Discovery. Moreover, if you have githubs or colabs, all is perfecto for me. Thanks!
I already subscribe to your chanel. First time I found something like this after very long time searching for this kind of videos. I'm very pleased with the information on this video. Congratulations 👏🎊🎉 dear Data professor.I want to become data scientist on the field of drugs discovery and design, can you advise me where to start and which book should I reed first from the list below. Thank you.
Hi, thanks for the comment. This channel got you covered, make sure to go through this Bioinformatics playlist of 17 videos (more to come) that includes theory and practice (step-by-step) to get you started in doing bioinformatics projects. bit.ly/dataprofessor-bioinformatics
I would like to suggest a video using RDKit for conformational search and energy minimization. I'm just starting my bioinformatic project and this video was really helpfull! Thanks!
awsome series.......could you please make a video on extracting important information of drugs like chemical structure, target proteins, sideeffects from different databases...thankyou
Hey, I'm a pharmacy graduate and I'm new to data science, we're working on solubility prediction, your video seems to be useful to estimate the aq.sol of compounds. We have few new compounds whose solubility needs to be estimated. If you can tell me how to use the above method for new compounds it will be more helpful to us.
Hi, I've made a video on how to build a solubility prediction web app, you can check it out here ruclips.net/video/iZUH1qlgnys/видео.html A demo of this web app is also provided in the video description.
How can this be used to do DIY, at home drug discovery? What I mean by this is, does this only have academic value or can you apply the results to something you can easily get without an university degree or comparable credentials? As you might know, a small company started selling kits that allow you to genetically modify bacteria and frogs with crispr. I am looking for something along the lines.
The computational model can definitely be built by anyone if following the step-by-step tutorials. As for bringing the discovered knowledge to the next step, you may need to collaborate with many people (chemists, other biologists, FDA officials/ clinical trials, etc.) Bringing a drug to market is a billion-dollar endeavor that involves many people/organizations.
@@DataProfessor Not everyone lives in the united states. That being said, I conclude that this only has academic value. Stop and think for a moment about exceptions. Which experiments can you conduct that come as close as possible to real drug design without the need for paid chemists, other biologists, clinical trials?
@@vrschwrngsthrtkr22 Thanks for the discussion. I agree, that it takes a lot of resources for bringing a drug to market. Actually, much of the budget for carrying drug discovery and development are from big pharmaceutical companies while academia accounts for a minor portion.
@@DataProfessor Which experiments can you conduct that come as close as possible to real drug design without the need for paid chemists, other biologists, clinical trials?
You can use pandas apply function to simplify a lot of the computation involving the ratio between aromaticity and heavy atoms. if you define a function def foo(row): ## compute aromaticity by heavy atom for a single row ## row contains the fields for a given row in your pd dataframe then do df["aromatic_to_heavy"] = df.apply(foo) I think with pandas its best to use their in-built functions for iterations to save computational power
Thank you for your videos they are amazing!! Is it possible to provide us with some tips theoretical or practical knowledge on the data science for bioinformatics? e.g which are the most used data-preprocessing steps, feature selection steps and models, validation modes and on which bioinformatic problems can we use them?
I think you can, DNN is used to build the model, you’ll have to decide which input are you using, e.g. SMILES, chemical structure image, descriptors, fingerprints, etc.
i am a doctor in organic chemistry and macromolecular , I want to predicted the solubility of polymers using delanay predicted , p^lease can you give me the idea or the step that I should do it in the first I want to make experience and compared with this can you help me
Do you mean for this dataset? yes, you can also use that here. This tutorial reproduces the research published by Dalaney and so also used linear regression to match the approach that they used.
Thanks, professor. It is a really good tutorial. Can I use this method to predict the solubility of one chemical in a liquid solution (or lipid solubility)? If not , is there any suggestions or tools I can use for lipid solubility prediction? Thank you very much.
Hi prof, thank you for posting and sharing knowledge. My background is computer science and I am interested this topic about data science or AI driven in drug discovery since last year especially about drug interaction. However along the way, I found some difficulties regarding to understand this biological data, process and terms. Could you give me some tips to thrive in this field? I really want this field a to be my primary research topic in master degree
Hi Lia, thanks for sharing your interest in computational drug discovery. I've written some review articles on the topic that may provide some introductory viewpoints to the field. www.tandfonline.com/doi/full/10.1517/17460441.2015.1016497 www.researchgate.net/publication/338639486_Best_Practices_for_Constructing_Reproducible_QSAR_Models A more complete list is at www.researchgate.net/profile/Chanin_Nantasenamat/research
@@DataProfessor hi professor, thank you so much for this. I am sorry for the late reply. I am starting to follow your video learning so I can understand both coding and biology data at the same time. Hopefully, we can collaborate in academic research in the future :). Keep spreading the knowledge, you are a great tutor
Hello sir, how to calculate drug pairwise similarity based on chemical structure fingerprint corresponding to 881 chemical structures defined in PubChem database?
@@DataProfessor thanks a lot ...is there any video of yours on that as i am new to this field .i have to start from scratch...any suggestions will be highly appreciated
Thanks for watching! There are larger datasets available on chemical databases such as ChEMBL, PubChem, BindingDB, etc. which can be used as external datasets to the dataset used in this video.
The answer really depends on the type of work that you would like to do. PhD is not necessary to become a data scientist working in the field of biotechnology/ bioinformatics. A PhD is necessary if you want to become a principal investigator and lead a research group (probably applicable more to academia, and maybe industry)
Satvik, coincidentally, it is in the making, I have already drafted a notebook but will film the video soon, please stay tuned. Please turn on the notifications so that you will be notified as soon as a new video comes out. Thanks for your suggestion!
Thanks Miroslava for the kind comment. The link to the notebook code for all videos in this channel is in the video descriptions of all videos. For this video, the link is github.com/dataprofessor/code/blob/master/python/cheminformatics_predicting_solubility.ipynb
Excellent videos by the Data Professor. Feel free to read the following blog paper on Medium website “Apply Machine Learning Algorithms for Genomics Data Classification”. This will help you to understand how to apply Machine Learning algorithms for genomic data classification. This blog paper contains the latest ML/AI technologies applied to human genomic data classification today.
Thanks for the comment, owing to compatability issues, the code provided in the header part of the code on the provided DataProfessor GitHub works optimally. Please kindly refer to the provided code link in the video description.
It's totally not my place to say, and I know the professor must have thought a lot about his "lisp", but I believe he can maser the "s" sound if he focuses on how his teeth touch the tongue when pronouncing an open "eeeeeeeeeee" sound and slowly raising the tip of his tongue; AT SOME POINT STOP VIBRATING your throat (while making the "eee" sound) and just allow air to pass the tube created by the tongue and slowly raise the tip of your tongue to the roof of your mouth (it doesn't need to TOUCH the roof) without blocking the entire passageway for the air. ALSO for him specifically he might try to pull back the tongue (keeping contact with the same portion of the roof of the moutn) and use a more forward portion of the tip of the tongue. Air should NOT leave anywhere but from the front of the tongue
Did you find value in this End-to-end tutorial in Bioinformatics/Cheminformatics? If you would like more videos like this please give it a 👍Like and ❤️Subscribe to the channel. Please comment down below your thoughts and suggestions 👇
I'm just starting with python, I'm an organic chemist looking to change my field from wet lab to comp chem, this is gold for me, thank you very much.
Thanks Franco for the comment. I'm planning on making more of these data science for drug discovery videos.
@@Kelrash31 Hi Alain, it's been slow but consistent. I've been working on QSAR and Docking at the moment. Still haven get that much into scripting for data managing but it's in the future plans.
@@DataProfessor Hi I am enthusiastic to learn QSAR and MD simulation for Protein Degraders. Do you recommend particular blog or any Book to learn data science for drug design/optimization. Your lectures/explanation are fantastic, getting enlightened all the pros/cros about CADD. Thank you very much
I'm presently working on drug discovery with deep neural networks and I found this tutorial very helpful.
That's awesome Michael! Speaking of drug discovery, I have a several more videos covering the topic here, you are more than welcome to check them out bit.ly/dataprofessor-bioinformatics
i m a student in india i have so much interest in bioinformatics and i found u thank u so much.... I m currently studying to get in a college for bioinfo undergrad
It's my pleasure, welcome to the channel and welcome to bioinformatics 😊
Just loving the content you make.
Thanks Shweta for the kind comment!
I was waiting for this one, thank u 😁
Thanks Marco for the comment! Glad to hear that!
@@DataProfessor Great Video! What books you recommend me? I am going to start my PhD in Drug Discovery. Moreover, if you have githubs or colabs, all is perfecto for me. Thanks!
are these descriptors any better than molecular fingerprint ?
I already subscribe to your chanel. First time I found something like this after very long time searching for this kind of videos. I'm very pleased with the information on this video. Congratulations 👏🎊🎉 dear Data professor.I want to become data scientist on the field of drugs discovery and design, can you advise me where to start and which book should I reed first from the list below. Thank you.
That's great, kindly make a whole series for drug discovery pipeline using ML/DL methods, Thanks you
Hi, thanks for the comment. This channel got you covered, make sure to go through this Bioinformatics playlist of 17 videos (more to come) that includes theory and practice (step-by-step) to get you started in doing bioinformatics projects. bit.ly/dataprofessor-bioinformatics
@@DataProfessor thank you so much Professor
I would like to suggest a video using RDKit for conformational search and energy minimization. I'm just starting my bioinformatic project and this video was really helpfull! Thanks!
awsome series.......could you please make a video on extracting important information of drugs like chemical structure, target proteins, sideeffects from different databases...thankyou
Hello Mam
Wich field you belong..?
Hey, I'm a pharmacy graduate and I'm new to data science, we're working on solubility prediction, your video seems to be useful to estimate the aq.sol of compounds. We have few new compounds whose solubility needs to be estimated. If you can tell me how to use the above method for new compounds it will be more helpful to us.
Hi, I've made a video on how to build a solubility prediction web app, you can check it out here ruclips.net/video/iZUH1qlgnys/видео.html
A demo of this web app is also provided in the video description.
How can this be used to do DIY, at home drug discovery? What I mean by this is, does this only have academic value or can you apply the results to something you can easily get without an university degree or comparable credentials? As you might know, a small company started selling kits that allow you to genetically modify bacteria and frogs with crispr. I am looking for something along the lines.
So?
The computational model can definitely be built by anyone if following the step-by-step tutorials. As for bringing the discovered knowledge to the next step, you may need to collaborate with many people (chemists, other biologists, FDA officials/ clinical trials, etc.) Bringing a drug to market is a billion-dollar endeavor that involves many people/organizations.
@@DataProfessor Not everyone lives in the united states. That being said, I conclude that this only has academic value.
Stop and think for a moment about exceptions. Which experiments can you conduct that come as close as possible to real drug design without the need for paid chemists, other biologists, clinical trials?
@@vrschwrngsthrtkr22 Thanks for the discussion. I agree, that it takes a lot of resources for bringing a drug to market. Actually, much of the budget for carrying drug discovery and development are from big pharmaceutical companies while academia accounts for a minor portion.
@@DataProfessor Which experiments can you conduct that come as close as possible to real drug design without the need for paid chemists, other biologists, clinical trials?
Precise and clear.Thank you so much!
Thanks for watching and for the kind words 😁
You can use pandas apply function to simplify a lot of the computation involving the ratio between aromaticity and heavy atoms.
if you define a function
def foo(row):
## compute aromaticity by heavy atom for a single row
## row contains the fields for a given row in your pd dataframe
then do
df["aromatic_to_heavy"] = df.apply(foo)
I think with pandas its best to use their in-built functions for iterations to save computational power
Can you help me with finding how to counts the number of atom in compound with rdkit?I wanna use'non-carbon proportion' feature in my model!
Yes, you can use the .GetNumAtoms() function on the molecule object. More details provided here www.rdkit.org/docs/GettingStartedInPython.html
Thanks. It is very great learning process via your video
You are most welcome
I have completed my biomedical engineering...and currents working in clinical SAS will I be able to Switc job in this domain ?
thank you professor please share more about drug discovery
Thanks Samet, my pleasure, please stay tuned by hitting the notification bell 😃
Thank you for your videos they are amazing!! Is it possible to provide us with some tips theoretical or practical knowledge on the data science for bioinformatics? e.g which are the most used data-preprocessing steps, feature selection steps and models, validation modes and on which bioinformatic problems can we use them?
Thanks Michalis for the suggestion! I’ll put this excellent idea into the to-do list for future videos.
i'm try predicted drung FDA approved using pharmacophore modelling, please as can i use DNN method for this approach.
I think you can, DNN is used to build the model, you’ll have to decide which input are you using, e.g. SMILES, chemical structure image, descriptors, fingerprints, etc.
i am a doctor in organic chemistry and macromolecular , I want to predicted the solubility of polymers using delanay predicted , p^lease can you give me the idea or the step that I should do it in the first
I want to make experience and compared with this
can you help me
Will there be usage of other ensemble algorithms like adaboost and GB ?
Do you mean for this dataset? yes, you can also use that here. This tutorial reproduces the research published by Dalaney and so also used linear regression to match the approach that they used.
@@DataProfessor thank u professor for the response.
Could you also give some thoughts on how this model helps in future in drug development process ?
Thanks, professor. It is a really good tutorial. Can I use this method to predict the solubility of one chemical in a liquid solution (or lipid solubility)? If not , is there any suggestions or tools I can use for lipid solubility prediction? Thank you very much.
Amazing 👌🙌👌 insightful
Thanks for watching!
@Data Professor, many thanks
Thank you
Hi prof, thank you for posting and sharing knowledge. My background is computer science and I am interested this topic about data science or AI driven in drug discovery since last year especially about drug interaction. However along the way, I found some difficulties regarding to understand this biological data, process and terms. Could you give me some tips to thrive in this field? I really want this field a
to be my primary research topic in master degree
Hi Lia, thanks for sharing your interest in computational drug discovery. I've written some review articles on the topic that may provide some introductory viewpoints to the field.
www.tandfonline.com/doi/full/10.1517/17460441.2015.1016497
www.researchgate.net/publication/338639486_Best_Practices_for_Constructing_Reproducible_QSAR_Models
A more complete list is at www.researchgate.net/profile/Chanin_Nantasenamat/research
@@DataProfessor hi professor, thank you so much for this. I am sorry for the late reply. I am starting to follow your video learning so I can understand both coding and biology data at the same time. Hopefully, we can collaborate in academic research in the future :). Keep spreading the knowledge, you are a great tutor
Hello sir, how to calculate drug pairwise similarity based on chemical structure fingerprint corresponding to 881 chemical structures defined in PubChem database?
Hi the pairwise molecular similarity can be computed using the Tanimoto coefficient, I think rdkit allows to do that.
@@DataProfessor thanks a lot ...is there any video of yours on that as i am new to this field .i have to start from scratch...any suggestions will be highly appreciated
Very useful.
Where to find good data for training?
Thanks for watching! There are larger datasets available on chemical databases such as ChEMBL, PubChem, BindingDB, etc. which can be used as external datasets to the dataset used in this video.
Perfect
Does PhD is necessary for becoming a data scientist in the field like biotechnology or bioinformatics?
The answer really depends on the type of work that you would like to do. PhD is not necessary to become a data scientist working in the field of biotechnology/ bioinformatics. A PhD is necessary if you want to become a principal investigator and lead a research group (probably applicable more to academia, and maybe industry)
Can you please make a simple tutorial for gromacs on colab.
Satvik, coincidentally, it is in the making, I have already drafted a notebook but will film the video soon, please stay tuned. Please turn on the notifications so that you will be notified as soon as a new video comes out. Thanks for your suggestion!
love love ,nice course。
Thank you 👍
Could you share the notebook? Very nice tutorial.
Thanks Miroslava for the kind comment. The link to the notebook code for all videos in this channel is in the video descriptions of all videos. For this video, the link is github.com/dataprofessor/code/blob/master/python/cheminformatics_predicting_solubility.ipynb
Awesome
Excellent videos by the Data Professor. Feel free to read the following blog paper on Medium website “Apply Machine Learning
Algorithms for Genomics Data Classification”. This will help you to understand how to apply Machine Learning algorithms for
genomic data classification. This blog paper contains the latest ML/AI technologies applied to human genomic data classification today.
Could you please tell me how to get rdkit in python2 colab notebook?
Thanks for the comment, owing to compatability issues, the code provided in the header part of the code on the provided DataProfessor GitHub works optimally. Please kindly refer to the provided code link in the video description.
Great exercise! I am also a Data Scientist but my videos are in spanish in case there's anyone interested!😁😁
Muy bien! Awesome channel Ana!
It's totally not my place to say, and I know the professor must have thought a lot about his "lisp", but I believe he can maser the "s" sound if he focuses on how his teeth touch the tongue when pronouncing an open "eeeeeeeeeee" sound and slowly raising the tip of his tongue; AT SOME POINT STOP VIBRATING your throat (while making the "eee" sound) and just allow air to pass the tube created by the tongue and slowly raise the tip of your tongue to the roof of your mouth (it doesn't need to TOUCH the roof) without blocking the entire passageway for the air. ALSO for him specifically he might try to pull back the tongue (keeping contact with the same portion of the roof of the moutn) and use a more forward portion of the tip of the tongue. Air should NOT leave anywhere but from the front of the tongue
I'm totally lost with the aromatic atoms part
Hi Kris, I have written a complementary article on Medium explaining this at link.medium.com/8OB3NXKwo9