- Видео 25
- Просмотров 211 157
Parallel Computing and Scientific Machine Learning
США
Добавлен 1 сен 2020
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource.
github.com/SciML/SciMLBook
Chris Rackauckas, Massachusetts Institute of Technology
Additional information on these topics can be found at:
sciml.ai/ and other Julia programming language sites
Many of these descriptions originated on www.stochasticlifestyle.com/
github.com/SciML/SciMLBook
Chris Rackauckas, Massachusetts Institute of Technology
Additional information on these topics can be found at:
sciml.ai/ and other Julia programming language sites
Many of these descriptions originated on www.stochasticlifestyle.com/
Mixing Differential Equations and Neural Networks for Physics-Informed Learning
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource.
github.com/SciML/SciMLBook
Chris Rackauckas, Massachusetts Institute of Technology
Additional information on these topics can be found at:
sciml.ai/ and other Julia programming language sites
Many of these descriptions originated on www.stochasticlifestyle.com/
github.com/SciML/SciMLBook
Chris Rackauckas, Massachusetts Institute of Technology
Additional information on these topics can be found at:
sciml.ai/ and other Julia programming language sites
Many of these descriptions originated on www.stochasticlifestyle.com/
Просмотров: 6 566
Видео
Uncertainty Programming: Differentiable Programming Extended to Uncertainty Quantification
Просмотров 2,2 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Global Sensitivity Analysis
Просмотров 10 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
From Optimization to Probabilistic Programming
Просмотров 2,9 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Code Profiling and Optimization (in Julia)
Просмотров 4,4 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
GPU Programming in Julia
Просмотров 5 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Parallel Computing: From SIMD to SIMT
Просмотров 5 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Partial Differential Equations (PDEs), Convolutions, and the Mathematics of Locality
Просмотров 2,6 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Differentiable Programming Part 2: Adjoint Derivation for (Neural) ODEs and Nonlinear Solve
Просмотров 3,3 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Differentiable Programming Part 1: Reverse-Mode AD Implementation
Просмотров 3,5 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Basic Parameter Estimation, Reverse-Mode AD, and Inverse Problems
Просмотров 4,5 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Solving Stiff Ordinary Differential Equations
Просмотров 6 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Forward-Mode Automatic Differentiation (AD) via High Dimensional Algebras
Просмотров 7 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Ordinary Differential Equations 2: Discretizations and Stability
Просмотров 3,4 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
Ordinary Differential Equations 1: Applications and Solution Characteristics
Просмотров 4 тыс.4 года назад
In Fall 2020 and Spring 2021, this was MIT's 18.337J/6.338J: Parallel Computing and Scientific Machine Learning course. Now these lectures and notes serve as a standalone book resource. github.com/SciML/SciMLBook Chris Rackauckas, Massachusetts Institute of Technology Additional information on these topics can be found at: sciml.ai/ and other Julia programming language sites Many of these descr...
The Different Flavors of Parallelism: Parallel Programming Models
Просмотров 4,2 тыс.4 года назад
The Different Flavors of Parallelism: Parallel Programming Models
The Basics of Single Node Parallel Computing
Просмотров 7 тыс.4 года назад
The Basics of Single Node Parallel Computing
How Loops Work 2: Computationally-Efficient Discrete Dynamics
Просмотров 3,9 тыс.4 года назад
How Loops Work 2: Computationally-Efficient Discrete Dynamics
How Loops Work 1: An Introduction to the Theory of Discrete Dynamical Systems
Просмотров 7 тыс.4 года назад
How Loops Work 1: An Introduction to the Theory of Discrete Dynamical Systems
Introduction to Scientific Machine Learning 1: Deep Learning as Function Approximation
Просмотров 14 тыс.4 года назад
Introduction to Scientific Machine Learning 1: Deep Learning as Function Approximation
Introduction to Scientific Machine Learning 2: Physics-Informed Neural Networks
Просмотров 18 тыс.4 года назад
Introduction to Scientific Machine Learning 2: Physics-Informed Neural Networks
Optimizing Serial Code in Julia 1: Memory Models, Mutation, and Vectorization
Просмотров 21 тыс.4 года назад
Optimizing Serial Code in Julia 1: Memory Models, Mutation, and Vectorization
Getting Started with Julia (for Experienced Programmers)
Просмотров 29 тыс.4 года назад
Getting Started with Julia (for Experienced Programmers)
Optimizing Serial Code in Julia 2: Type inference, function specialization, and dispatch
Просмотров 9 тыс.4 года назад
Optimizing Serial Code in Julia 2: Type inference, function specialization, and dispatch
Parallel Computing and Scientific Machine Learning Course: Syllabus
Просмотров 28 тыс.4 года назад
Parallel Computing and Scientific Machine Learning Course: Syllabus
That was a rather abrupt end to the lecture, but I think the material was covered well enough.
Amazing, I can't thank you enough.
i got something similar, but this has to work with multi-var calc... so if you define dx as infinitesimal positive, you are saying: dx > 0 and dx*dx -> 0. if you implicitly diff, then you have an operator d[] d[a+b] = d[a] + d[b]. d[a*b] = d[a]*b + a*d[b] then for d[a^b] and d[log_a[b]], the functions are complicated, and don't necessarily handle non-commutative args in general. but if d[a^b] where b is scalar, and d[log_a[b]] where a is scalar, it's easy to define. d[f=a^2 + b^2] = df = 2a da + 2 b db. for df/db: da=0, and db>0, and db*db->0. How does this work on trillion variable calculus?
trying to include epsilon seems to not help ie: is it: dx=DX*epsilon and dy = DY*epsilon f = x^2 + y^2 df = 2x (DX*epsilon) + 2y (DY*epsilon) df/dx = 2x + 2y * DY/DX
I am really enjoying your lectures. What program are you using for the notes.
I think its LyX, if you still need it.
Now that Juno is gone what is the best environment to get all these variable value previews?
Hi Chris, great lectures, thanks to made them public. Which software do you use to write math expressions while presenting the lecture?
So say I have data corresponding to spatiotemporal states like u_i(x,t) where 1<=x<=L and 1<=t<=T and I want some number of iterations of trajectories N so that 1<=i<=N. If I want to store all of the trajectories at every time I should have an array of dimensions size(u) = (N,T,L) so that I can quickly access u_1(x,2) for example. in code it would be u[1,2,:]. ORRR should I reverse the definition so that size(u) = (L,T,N) so u_1(x,2) is given by u[:,2,1]? I'm thinking the latter...
Flashback from my MSc in Physics 12 years ago. Thank you very much for sharing.
Great 💯👍 video 👍📸 thank you 💯💯
what's the name of the program on the left side of the screen?
Amazing video!!! Thank you so much for sharing this. I made my code about x10 times faster just by implementing what Chris says in this first lecture. I didn't know that heap allocation takes so much time. Looking forward to next lectures.
This guy is fantastic, very clear at explaining things. The conclusion was also super encouraging. Thank you.
😱
Glad I chance upon this lecture series because it was very informative! But I can't believe you claim ff8 > ff7 😡
this already super insightful! thank you very much :)
In the "good old days", LaTex was pronounced "lay-techs".
Great explanation on dynamic vs static scheduling! I think that is something that should be part of the manual but it is unfortunately not emphasized enough.
Love the content. Chris Rackauckus is my hero He compared neural nets to Fourier and polynomials bases, but one other basis I wonder about is step functions, which are the basis for boosting. I think those still have the same curse of dimensionality as polynomials and fourier (they don't overcome it like neural nets), but they seem to be fast with a lot of features as well, correct? (I'm thinking xgboost)
What IDE / editor are you using? It looks like it has some useful features.
Atom
Great. I have just found this course. It is very interesting. Just is there any notes or notebooks with this course?
All of the notes are at book.sciml.ai/ with source code github.com/SciML/SciMLBook
Incredible lecture. Thank you for sharing
Dear Sir, how to perform different trajectories at the same time parallely in morris method.. As stated in the primer , P∗ is a k-by-k random permutation matrix in which each row contains one element equal to 1, all others are 0, and no two columns have 1’s in the same position. P∗ gives the order in which factors are moved. How to create a different ordered P* matrix each time for each trajectory
I remember academic discussions in 1994 about how something like a dog catching a frisbee was a neural network solving an ode, which was cool. I think that was when they started engaging on the topic.
I am confused as to why there is a const in front of the a2 array if the values can change. Does this just mean that the size of the array can't change?
Good question! Constant variables can be mutated. They cannot be replaced. Re-assigning is not allowed, and for arrays assignment is the change of reference, not the change of values in the array.
@@scimlorg That clears things up. Thank you! The series has been super helpful btw :)
extremely helpful for freshman like me! 👍
Ugh the slack notification sound gets me every time haha
Wow this is great, very helpful
Are we going to have a newer edition of this course?! Can't wait for it!
More videos will be added soon.
@@scimlorg - hello any update on when more videos will be added - thank you
Thanks for organizing the lectures into a playlist now I know how to follow along the roadmap
0:36 Installing Julia 2:40 Setting up an IDE (Atom or VS Code) 9:35 Activate environments 12:55 First .jl script 13:45 Inline help 18:40 PkgTemplates 23:10 Unicode variables and operators 24:30 import, export, and using 26:20 Documentation and further resources
This is absolutely fantastic. Thanks for sharing.
What editor/IDE are you using?
Thanks Chris for the amazing lecture
Amazing lecture again
julia is really great, especially for people coming from matlab. makes me wonder why alot of industry is still using python.
At 14:11 it says that we can improve the fit by increasing the size of the NN, using more training points, or by training for more iterations. Wouldn't the type of loss function used also have an affect on the performance of the learning algorithm? The first thing I was thinking of is that the sum of squared loss we used isn't exactly the best for use in neural networks. If I recall correctly, sum of squared loss is non-convex with respect to the parameters when using tanh activation functions. So optimization over this loss function is going to give suboptimal results. Maybe this is negligible in the case of this problem but I think that using cross-entropy loss or one of it's derivatives would provide better performance.
what loss we are using here isn't really important. It's the idea of how to use a physical equation (such as Hooke's Law used in the video) in a loss function.
Tks for this!
Please make a playlists for your neural Odes and differential eqns if you can. You have a lot of hidden gems that needs to be organized
Incredible lecture.
LOL 36m50s... can't redefine function g... no problem, just rename it gg.
This video is FANTASTIC. As someone who is currently learning about compilers and LLVM, this video provides a lot of insight on how many cool things compliers can do.
Can you explain to me why an "experienced programmer" needs to be baby walked through setting up an editor? I am "experienced" enough to know what editors I actually like using (for sure not Atom or VS Code lol) and how to figure out how to use them for any programming language I want.
Good for you. I personally prefer the hand holding. I'm pretty sure the 'experienced programmer' is primarily to encourage beginner programmers to look elsewhere if they're just learning to use Julia, or even programming itself. That being said, as a programmer with 'intermediate' experience, I actually don't have a good grasp of many things covered here, like package development, package management, testing, etc.
I think there is some confusion with terminlogies that deserves clarification. Broadcasting, vectorization are being interchanged. Where I come from from (deep learning), broadcasting refers to extending an operation to other dimensions of one of two operands that don't have same shapes. Example: a has shape [4, 1] B has shape [4, 10] Then, broadcasting *allows* adding / multiplying such objects. The rationale is: *a* is a vector, *B* is an array of vectors. The addition is to happen between *a* and *columns* of B resulting in a matrix that looks like *B* with all its columns being equal to *a* + original column value. Vectorization: a function designed to work on scalars, give me a version of it that works fast on vectors by feeding the vectors directly instead of looping over their elements and calling the function just as many times. They might be similar at core, but I have noticed that in some specific fields, the disinction is due.
Isn't vectorisation just a special case of broadcasting? In which case the usage seems correct to me.
@@torstenschenkel2110 After a second thought, I see you are right, or may be it is the other way around. I claim this because I see that vectorization seem to be something that you explicitly ask for via "." notation, while broadcasting happens spontaneously whenever the dimensions are condusive. This is the case Julia, Numpy, TensorFlow, Pytorch.
Thanks for the lecture! By the way I don't quite understand the speed differences between push! and append!. The output is the same, but I suppose the backstage process is different. Regards!
Really good lecture! I have only the question of why the "Stochastic dynamical model" isn't modeled more directly as a Markov Chain?
Well, with an AR(1) process, u_{n+1} depends only on u_{n} so it satisfies markov property. But with high order autoregressive processes AR(N), u_{n+1} will depend on u_{n}, ..., u{n+1-N}. So I believe that it may not satisfy the Markov Property. I could be wrong though :P
What is the name of the software being used for typing math equations?
Wondering exactly the same!
It is LyX, a simplified LaTeX editor
How is julia for making highly concurrent web apps?
excellent explanation
It's important to highlight that KL Divergence is NOT the EMT (Earth movers distance) / OT and its NOT a distance. The EMT analogy is confusing because you are using a commonly employed analogy for EMT and applying it to KL which can implicitly confuse the two concepts for a viewer. There is a relationship between the two, argmin_{p(x,y) \in D(pi_0, pi_1)} KL (p(x,y) || N(x,y, s * I )) as s ->0 this KL becomes the EMT between distributions pi_0 and pi_1 using a squared error cost (Wasserstein distance). Where D(pi_0, pi_1) is the set of distributions p(x,y) with pi_0(x) and pi_1(y) as marginals.
Hi! I don't understand what kind of differential equations can be solved with this iterative process, for example in this case you use the Lorenz System but it can be consider a general approach for any differential equation?
Hi!, What I think is that we are trying to approach to a general Dynamical System (like a Markov process). Then for example any system with mutating states can be thought like that