Photutils tutorial notebook #2: source detection #3

laurenmarietta · 2018-09-19T15:12:48Z

The second notebook tutorial for photutils, demonstrating methods for source detection.

Some preliminary concerns I have about this notebook:

I'm not sure that my rationalization for why IRAFStarFinder and DAOStarFinder have such different results (because IRAF doesn't allow for elliptical Gaussians) is correct, since I have only the most superficial understanding of how those algorithms work. I would appreciate some feedback from someone who understands them well.
Is it too long? If so, I feel as though the image segmentation notebook could potentially be a separate notebook completely.

@eteq I know you mentioned that Tom might be reviewing these notebooks now - if you know his GitHub account and want to tag/assign him, please do!

closes eteq#2)

…small changes.

# Conflicts: # .gitignore # photutils/01_photutils_background_estimation.ipynb # photutils/02_photutils_source_detection.ipynb

eteq · 2018-09-19T16:12:38Z

Thanks @laurenmarietta - Tom is @Onoddil, so definitely makes sense for him to have a look at this as a "user". I'll also plan to take a look at it from my perspective in more detail later.

On the specific concerns:

I'm not sure that my rationalization for why IRAFStarFinder and DAOStarFinder have such different results

I agree this is rather odd. I would have expected <5% difference, not ~90%! I'll try to have a look/ask around some other experts and see if we're missing something.

Is it too long?

I think the length is fine. I agree the segmentation seems a bit out-of-context, but I think it might be even worse to have a separate one for segmentation... An interesting experiment to tie it together might be to do a comparison of the segmentation and other methods? My prediction is that it'll work better on galaxies and worse for stars... but you never know!

eteq · 2018-09-19T16:56:20Z

@laurenmarietta - Ah, I think I have part of the answer. If you look at the IRAFStarFinder and DAOStarFinder you'll see their default settings for the various statistics like sharplo/sharphi and roundlo/roundhi are different. I tried setting them to the same, and then they're a lot closer (although still not as much the same as I would have thought... but within ~30%). So maybe try that as a starting point?

Two other things to just mention that I noticed on the way:

Your current FWHM is wide enough that it's actually missing stars, I think. If you zoom in on a star you'll see that it's only maybe ~3 pixels wide. Since these algorithms are optimized for stars anyway you might lower the FWHM (and the threshold) to get more stars and fewer galaxies? That might also resolve the IRAF/DAO difference since stars are more circular than galaxies anyway
You might find it useful to show cutouts of individual objects - the following worked well for me so feel free to incorporate it if you think it's useful:

#DAO
fig, axs = plt.subplots(3,3)

cutout_sz = 20

srcs = np.random.permutation(sources_dao)[:axs.size]
for ax, src in zip(axs.ravel(), srcs):
    slc = (slice(int(src['ycentroid']-cutout_sz), int(src['ycentroid']+cutout_sz)),
           slice(int(src['xcentroid']-cutout_sz), int(src['xcentroid']+cutout_sz)))
    ax.imshow(xdf_image[slc], norm=norm_image)
    ax.text(2, 2, str(src['id']), color='w', va='top')
    ax.set_xticks([])
    ax.set_yticks([])

Onoddil · 2018-09-20T14:14:17Z

So I don't think the issue can be the elliptical nature of the Gaussian as, without specifying, DAOStarFinder has ratio=1.0 as its default value, assuming circular Gaussians. If I run IRAFFinder with

iraffind = IRAFStarFinder(fwhm=5.0, threshold=20.*std, minsep_fwhm=0.0, sharplo=0.2,
sharphi=1.0, roundlo=-1.0, roundhi=1.0, sky=0.0)
sources_iraf = iraffind(xdf_image * ~xdf_image.mask)
print(sources_iraf)

I get:

id xcentroid ... flux mag
---- ------------------ ... ------------------- ---------------------
1 2509.734474225375 ... 0.15605459964717738 2.0168085653827426
... ... ... ... ...
1415 2514.8275136598045 ... 0.17862686840817332 1.870133038817817
Length = 1415 rows

bringing me into ~96% agreement with the DAOStarFinder. From a quick look at the reason seems to be minsep_fwhm, as if I remove that and run

iraffind = IRAFStarFinder(fwhm=5.0, threshold=20.*std, sharplo=0.2,
sharphi=1.0, roundlo=-1.0, roundhi=1.0, sky=0.0)
sources_iraf = iraffind(xdf_image * ~xdf_image.mask)
print(sources_iraf)

I get

id xcentroid ... flux mag
---- ------------------ ... ------------------- ---------------------
1 2509.734474225375 ... 0.15605459964717738 2.0168085653827426
... ... ... ... ...
1027 2514.8275136598045 ... 0.17862686840817332 1.870133038817817
Length = 1027 rows

which seems to agree with @eteq on the 30% difference. I therefore think the issue the default requirement that objects have to be ~13 pixels separated, which is just not going to be the case in the Hubble EXtremely Deep Field!

Re: @eteq 's 3 vs 5 pixel FWHM: if you decrease the FWHM IRAFStarFinder gains sources (likely missed stars), but DAOStarFinder loses sources (likely dropped galaxies larger than the new, smaller FWHM). However, for this test case I'm not sure that matters; we should simply go with the FWHM that gives a good selection of all sources, and perhaps briefly mention the tuning of the parameters to the science being done if that isn't already mentioned.

Just to throw a slight spanner in the self-similarity works if you run

IRAFStarFinder(fwhm=5.0, minsep_fwhm=0.0, threshold=20.*std)

(i.e., the IRAF default sharpness etc., just without the minimum distance requirement) you only get 280 sources, so at that point the large differences between default roundness and sharpness are likely dropping galaxies, as daofind allows roundness to be in the domain [-1, 1] but iraffind limits roundness to [0, 0.2] and iraffind requiring a lower sharpness of 0.5 to daofind's 0.2.

In summary, for this comment is a bit disjointed by this point: for a self-similar comparison you can set IRAFStarFinder to the same settings as DAOStarFinder, but require minsep_fwhm = 0.0, or you can justify the differences with a combination of the 13 pixel separation requirement and stricter limits on sharpness and roundness (where roundness is likely the galaxy/star divide issue), and consider a FWHM that gives the best balance between stars and galaxies for both finders.

…AOStarFinder

laurenmarietta · 2018-12-12T19:53:10Z

After what feels like a year, I have finally had the time to take another look at this notebook.

I've made just a couple changes - thanks, Erik, for that clever code to show cutouts of the sources. And thanks to both of you for your sleuthing regarding the differences between IRAFStarFinder and DAOStarFinder! I added a section in the notebook to address that specifically, which I am hopeful seems instructive and not just nit-picky.

If either of you spot anything else that could use clarification or improvement, please let me know.

…so angry

Onoddil · 2018-12-29T20:57:28Z

Managed to have a read of this notebook, but haven't got iPython to play nicely so I've not managed to verify the plot outputs (I'll have another look at these notebooks once I'm back in the US). The flow and words etc. look good though; I really just have one minor comment.

I'm not sure if it was deliberate or a slight formatting issue, but any of the "inline" comments (e.g., https://github.com/eteq/csi-stsci-notebooks/pull/3/files#diff-fbc70e2f8789c7a80f84a34d56d11e00R39 or any of the Exercises boxes) expose some formatting, such as asterisks for bolding, ` marks around `astropy`, URLs, etc. I assume these were supposed to format within their inline boxes, but haven't quite come out right. If it was a deliberate choice, however, then feel free to ignore this, but it looks slightly "incomplete" to me.

eteq

@laurenmarietta - Overall this looks really good! I saw several minor things inline, but actually I think this is almost there.

eteq · 2019-03-14T15:32:18Z