numberOfAtoms to n_atoms and others #376

hainm · 2015-08-04T03:55:20Z

guys,

as @orbeckst suggested from this issue #372, I am opening this issue to raise discussiong whether mdanalysis should change its behavior, like changing numberOfAtoms method to n_atoms property?

My point is quite simple: if your method is not expensive to evaluate, change it to attribute, which is much shorter (numberOfAtoms vs n_atoms). And The method's name should be shortened (as shown). The point is, we (including mdanalaysis) should follow some common patterns in Python's world to make users have smooth transition from one to another package. There is nothing wrong if "our" package is similar to others, a package still have some features that others don't have (for example, mdanalysis is quite good at membrane analysis (vs pytraj)).

PS: I myself wrote my own package (pytraj), so I am not benefit from the changing (:D) but mdanalysis.

My suggestion is too look at attributes from mdtraj package. It has pretty good naming stuff (one of pytraj's developers always suggesting using mdtraj in 1st place, so I think there must be a reason).

The text was updated successfully, but these errors were encountered:

hainm · 2015-08-04T04:00:06Z

Just for example.

this is the code from mdtraj

import mdtraj as md
traj = md.load(filename, topname)

and this is from pytraj

import pytraj as pt
traj = pt.iterload(filename, topname)

there is nothing wrong with having similar API. The main point is to make users feel comfortable with the interface. There is seamless conversion from mdtraj to pytraj. This helps when mdtraj or pytraj need helps from another package.

traj = pt.Trajectory(xyz=m_traj.xyz, top=topology_name)

richardjgowers · 2015-08-04T08:09:34Z

With n_atoms, numberOfAtoms etc, I'm in favour of getting rid of all of them and instead using len. AtomGroup, ResidueGroup and SegmentGroup are just containers, Python containers use len. Easiest to remember imo.

ag.numberOfAtoms() == len(ag)
ag.numberOfResidues() == len(ag.residues)

orbeckst · 2015-08-04T17:35:55Z

On 4 Aug, 2015, at 01:09, Richard Gowers wrote:

With n_atoms, numberOfAtoms etc, I'm in favour of getting rid of all of them and instead using len. AtomGroup, ResidueGroup and SegmentGroup are just containers, Python containers use len. Easiest to remember imo.

ag.numberOfAtoms() == len(ag)
ag.numberOfResidues() == len(ag.residues)

There's, however, something to be said for easy introspection. If I do

AtomGroup.n<TAB>

and get

AtomGroup.numatoms

I know immediately what it does.

I would definitely want to have len(AtomGroup) work, too. (And here I am actually disagreeing with the Zen of Python that there should be always only one way to do things... especially with overloading generic mechanisms it is not always clear what the domain-specific meaning is.)

I favor deprecating numberOfAtoms(), numberOfResidues(), numberOfSegments() and replacing them with managed properties numatoms, numresidues, numsegments (because we already have trajectory.numframes)... or if people like other variants like num_atoms or natoms or n_atoms better then we should deprecate numframes and turn it into n_frames.

(Note that I don't think "fast to type" is the biggest argument here --- clarity is more important for complicated and central data structures such as AtomGroup --- if anything, this is the object that most users will have contact with and that is probably the most confusing. Part of this and related issues is to make it behave more the way that an unsuspecting user would reasonably expect it to behave.)

orbeckst · 2015-08-04T17:42:35Z

On 3 Aug, 2015, at 21:00, Hai Nguyen wrote:

Just for example.

this is the code from mdtraj

import mdtraj as md
traj = md.load(filename, topname)
and this is from pytraj

import pytraj as pt
traj = pt.iterload(filename, topname)
there is nothing wrong with having similar API. The main point is to make users feel comfortable with the interface.

Although this is a separate issue, we could add a compatibility module along the lines of

import MDAnalysis.masquerade as mdtraj
u = mdtraj.load(trajectory, topology)

masquerade.load() would need to do some careful argument juggling to always get the topology (something like args[-1]) and do some sanity checks but at least this way we would have a non-intrusive way of accommodating alternative interfaces.

However, such a feature would very much depend on user demand and, well, user contributions, i.e. pull requests by the like of @hainm :-). (At the moment, the core devs have a lot to do with the most pressing other issues.)

orbeckst · 2015-08-04T23:14:20Z

Just saw that Timestep also uses numatoms (#250) so this should also be consistent with whatever else we decide.

dotsdl · 2015-08-06T18:30:46Z

I'm in favor of n_atoms, n_residues, n_segments, as well as trajectory.n_frames. Although I think numatoms and friends are easier to say (n_atoms feels a bit weird, but that might just be me), I think n_... is easier to read. I just spent a couple minutes staring at both sets of variants to make up my mind.

hainm · 2015-08-06T18:36:10Z

and they are (n_atoms, n_...) are shorters to write code (we're not always use ipython with <TAB>)

dotsdl · 2015-08-06T18:40:25Z

I'm not so concerned with shorter (the difference is a single character), but readability matters. Any counterpoints to using n_...?

richardjgowers · 2015-08-06T18:41:35Z

cough len cough :)

hainm · 2015-08-06T18:42:47Z

@richardjgowers
you meant len(ag.atoms) vs ag.n_atoms?

ag.n_atoms looks nicer and (like @orbeckst said), we can use <TAB> with n_atoms.

@dotsdl
I meant numberOfAtoms vs n_atoms (from @richardjgowers example)

dotsdl · 2015-08-06T18:45:33Z

@hainm no, scroll up.

dotsdl · 2015-08-06T18:48:51Z

@richardjgowers I agree with @orbeckst that we should keep methods for these things for introspection, since they would essentially be minimal effort to maintain. Although I am likewise tempted to say burn them all and just use len(), I don't think it's too big a deal. As long as we are consistent in the end. :D

kain88-de · 2015-08-06T20:19:03Z

For beginners having methods like 'n_atoms' is nicer. But why not support both?

Personally I would prefer 'numatoms' because the underscore is hard to reach.

orbeckst · 2015-08-06T20:20:53Z

Ok, final verdict:

n_atoms
n_residues
n_segments
n_frames

(and no getting rid of them in favor of len()... you purists all know the code inside out by now but I also have to consider what it looks like to new users... or PIs who occasionally still have to do real work... ;-) )

dotsdl · 2015-08-06T20:24:29Z

It will be done. On it.

orbeckst · 2015-08-09T23:33:18Z

@dotsdl , please add entry to CHANGELOG for n_atoms and friends – API breakage changes must be documented in the CHANGELOG.

orbeckst added usability API labels Aug 4, 2015

orbeckst added this to the 0.11 milestone Aug 4, 2015

orbeckst mentioned this issue Aug 4, 2015

helper script for upgrading from 0.10 to 0.11 (ten2eleven.py) #377

Closed

dotsdl self-assigned this Aug 6, 2015

dotsdl mentioned this issue Aug 7, 2015

Change usage of numberOfAtoms to n_atoms (same for residues, segments); change usage of numframes to n_frames #387

Merged

richardjgowers closed this as completed in #387 Aug 7, 2015

orbeckst reopened this Aug 9, 2015

dotsdl added a commit that referenced this issue Aug 11, 2015

updated CHANGELOG entry for #376

96f813a

orbeckst closed this as completed Aug 11, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

numberOfAtoms to n_atoms and others #376

numberOfAtoms to n_atoms and others #376

hainm commented Aug 4, 2015

hainm commented Aug 4, 2015

richardjgowers commented Aug 4, 2015

orbeckst commented Aug 4, 2015

orbeckst commented Aug 4, 2015

orbeckst commented Aug 4, 2015

dotsdl commented Aug 6, 2015

hainm commented Aug 6, 2015

dotsdl commented Aug 6, 2015

richardjgowers commented Aug 6, 2015

hainm commented Aug 6, 2015

dotsdl commented Aug 6, 2015

dotsdl commented Aug 6, 2015

kain88-de commented Aug 6, 2015

orbeckst commented Aug 6, 2015

dotsdl commented Aug 6, 2015

orbeckst commented Aug 9, 2015

numberOfAtoms to n_atoms and others #376

numberOfAtoms to n_atoms and others #376

Comments

hainm commented Aug 4, 2015

hainm commented Aug 4, 2015

richardjgowers commented Aug 4, 2015

orbeckst commented Aug 4, 2015

orbeckst commented Aug 4, 2015

orbeckst commented Aug 4, 2015

dotsdl commented Aug 6, 2015

hainm commented Aug 6, 2015

dotsdl commented Aug 6, 2015

richardjgowers commented Aug 6, 2015

hainm commented Aug 6, 2015

dotsdl commented Aug 6, 2015

dotsdl commented Aug 6, 2015

kain88-de commented Aug 6, 2015

orbeckst commented Aug 6, 2015

dotsdl commented Aug 6, 2015

orbeckst commented Aug 9, 2015