Documenting Design Patterns #891

MikeInnes · 2019-10-11T09:02:46Z

A lot of what Flux can do is not explicitly written down. Regularisation is a good example; just grabbing your parameters and summing them is really simple and intuitive, but if you're used to frameworks that provide an explicit API for this, you might get not think of it, or assume it's not supported at all if it isn't in the docs.

So we need to document Flux "design patterns" that explicitly cover features from other frameworks. Some things off the top of my head:

Regularisation
Debugging implicit gradient results (Grads)
Logging and debugging the forward pass
Logging and debugging the backwards pass, + clipping, dropping grads etc.
Advanced Gradients (link Zygote docs)
Custom initialisation of layers
Generally-useful parts of Flux + Julia ecosystem docs #251
Enabling/disabling GPU compute across a script
Flux.stop() and Flux.skip() (Optimizer handling of infinite loss #821)
RNN usage (Do not learn RNN initial state by default #808)
Bidirectional RNNs
Flux for TF/PyTorch user guides
Gradient clipping, e.g..

Ideas (or requests) for other features that we should document how to do are welcome.

The text was updated successfully, but these errors were encountered:

DhairyaLGandhi · 2019-10-11T09:06:47Z

Writing custom adjoints and intuition behind it. Maybe even a section on arbitrary code semantics (find, handling workers etc)

MikeInnes · 2019-10-11T09:08:18Z

Agreed, we definitely need to replace the backprop section that used to cover tracker. We don't want to duplicate all the Zygote docs, but some explanation in the context of Flux would be really helpful.

jumerckx · 2019-10-12T20:05:49Z

I'd love some explanation on mutation in a model using Buffer.

scheidan · 2019-10-16T14:46:55Z

An example of gradient clipping would be good too.

MikeInnes · 2019-10-24T10:53:42Z

@scheidan I think we should add some clipping layers (#672) but yes, giving people the know-how to do it themselves more generally is also a good idea of course.

@merckxiaan Is there anything Flux-specific we should say about Buffer? It's probably worth at least mentioning, a long with a few other more advanced tricks that are documented by Zygote.

jumerckx · 2019-10-24T10:58:59Z

I haven't got anything specific in mind but it'd be interesting to see a simple deep learning network that uses Buffers.
I'm sure Buffers are really helpful to make efficient models but I still don't really get how things can be achieved when mathematical operations are not permitted on them?

MikeInnes · 2019-10-24T11:02:47Z

Buffers are really just meant as a workaround when you want to do array construction using mutation. So you might use them inside the definition of a basic array op like cat, but they'd be completely transient; you wouldn't pass Buffers through a deep learning model.

janEbert · 2019-10-24T20:51:22Z

Advanced Gradients (link Zygote docs)

Adding to that, not tracking parameters should be included there (or somewhere else) as well.
You mentioned Zygote.dropgrads.

MikeInnes · 2019-10-25T10:26:35Z

Yes, that definitely also fits under "things we should have APIs for" too; added to the list.

appleparan · 2019-11-09T13:36:11Z

How about writing tutorial how to port tutorial codes from Tensorflow or Pytorch docs? Many other ML codes are written based on TF or torch and sometimes I was very confusing about how to convert them. For me, input set structure is most unfamiliar things compared to other frameworks. Without model-zoo, it is hard to use Flux.jl by reading its docs only. I think comparing other framework codes with Flux by 1vs1 would be helpful.

MikeInnes · 2019-11-11T16:22:01Z

Good idea. I think we could add "Flux for TF users" or "Flux for PyTorch users" guides as sections in the docs. Happy to help anyone who wants to contribute that.

RohitMazumder · 2019-11-12T05:20:40Z

If no one is already working on it, then I would love to contribute on "Flux for TF users" !

MikeInnes · 2019-11-12T13:34:34Z

That would be great!

cossio · 2020-01-13T13:22:57Z

Automatic parameter extraction for custom types, via @functor.

E.g., add this example from @dhairyagandhi96 to the docs, which shows how you can control what fields are added to the parameter tree:

julia> struct MyLayer{T,K}
         a::T
         b::K
       end
julia> using Flux: @functor
julia> @functor MyLayer (a,)
julia> _l = MyLayer(rand(3,3), rand(5))
MyLayer{Array{Float64,2},Array{Float64,1}}([0.8382666790752258 0.16279958328830713 0.8185255499278947; 0.10188918358486099 0.7421499443512403 0.7912103198124705; 0.7105677086316595 0.16360615883658625 0.5766784867701418], [0.7118888831680539, 0.3682507168932143, 0.18493277328287605, 0.025627816691143, 0.6084281600097385])
julia> Flux.params(_l)
Params([[0.8382666790752258 0.16279958328830713 0.8185255499278947; 0.10188918358486099 0.7421499443512403 0.7912103198124705; 0.7105677086316595 0.16360615883658625 0.5766784867701418]])
julia> size.(Flux.params(_l))
1-element Array{Tuple{Int64,Int64},1}:
 (3, 3)

DhairyaLGandhi · 2020-03-20T15:58:43Z

Adding transfer learning examples would be good too

DhairyaLGandhi · 2020-04-24T14:55:17Z

Data loading + viz and TensorboardLogger.jl integration

ToucheSir · 2021-06-14T18:25:21Z

Thoughts on breaking this into smaller issues and creating a project board for them? I know some (e.g. RNNs, ecosystem) have been addressed already.

MikeInnes added the documentation label Oct 11, 2019

MikeInnes mentioned this issue Oct 24, 2019

Need convenient utilities for looking inside Grads(...) FluxML/Zygote.jl#345

Closed

This was referenced Oct 26, 2020

Flux docs FluxML/FluxML-Community-Call-Minutes#8

Closed

Refresh Model Zoo FluxML/FluxML-Community-Call-Minutes#9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documenting Design Patterns #891

Documenting Design Patterns #891

MikeInnes commented Oct 11, 2019 •

edited

Loading

DhairyaLGandhi commented Oct 11, 2019

MikeInnes commented Oct 11, 2019

jumerckx commented Oct 12, 2019 •

edited

Loading

scheidan commented Oct 16, 2019

MikeInnes commented Oct 24, 2019

jumerckx commented Oct 24, 2019

MikeInnes commented Oct 24, 2019

janEbert commented Oct 24, 2019

MikeInnes commented Oct 25, 2019

appleparan commented Nov 9, 2019 •

edited

Loading

MikeInnes commented Nov 11, 2019

RohitMazumder commented Nov 12, 2019

MikeInnes commented Nov 12, 2019

cossio commented Jan 13, 2020 •

edited

Loading

DhairyaLGandhi commented Mar 20, 2020

DhairyaLGandhi commented Apr 24, 2020

ToucheSir commented Jun 14, 2021

Documenting Design Patterns #891

Documenting Design Patterns #891

Comments

MikeInnes commented Oct 11, 2019 • edited Loading

DhairyaLGandhi commented Oct 11, 2019

MikeInnes commented Oct 11, 2019

jumerckx commented Oct 12, 2019 • edited Loading

scheidan commented Oct 16, 2019

MikeInnes commented Oct 24, 2019

jumerckx commented Oct 24, 2019

MikeInnes commented Oct 24, 2019

janEbert commented Oct 24, 2019

MikeInnes commented Oct 25, 2019

appleparan commented Nov 9, 2019 • edited Loading

MikeInnes commented Nov 11, 2019

RohitMazumder commented Nov 12, 2019

MikeInnes commented Nov 12, 2019

cossio commented Jan 13, 2020 • edited Loading

DhairyaLGandhi commented Mar 20, 2020

DhairyaLGandhi commented Apr 24, 2020

ToucheSir commented Jun 14, 2021

MikeInnes commented Oct 11, 2019 •

edited

Loading

jumerckx commented Oct 12, 2019 •

edited

Loading

appleparan commented Nov 9, 2019 •

edited

Loading

cossio commented Jan 13, 2020 •

edited

Loading