TJ's annotated DRAFT PR not meant to be merged. #61

trjaffe · 2025-09-26T18:12:05Z

HEASARC demonstration and tutorial notebook review template

Requesting a review

Please request a review (through the GitHub pull request interface) from one of the HEASARC team members associated with the HEASARC-tutorials repository, being mindful of observer's workloads.

Available for all reviews:

David Turner
Jordan Eagle

Observers/available for some reviews:

Tess Jaffe
Abdu Zoghbi

Reviewers should attempt to provide initial comments within 1-2 days.

Please feel free to tag any user who you feel would like to discuss the notebook under review.

Critical review criteria

The author of the pull request should make an effort to go through these check points and ensure that their submission satisfies each point - reviewers will also compare to these checklists.

Science review checklist

Is there a use case in the introduction which motivates the code? will our community understand this motivation/code?
Does the code do what the intro says it is going to do?
Is it scientifically accurate?
Does it include work linked to a buzzword (e.g. big data, spectroscopy, time domain, forced photometry, cloud)?
Have all necessary references to literature been included?

Tech review checklist

Documentation:
- Is every function documented?
- Does it follow the style guide? https://github.com/spacetelescope/style-guides/blob/master/guides/jupyter-notebooks.md
- Do all code cells have corresponding narratives/comments?
- If designed for Fornax, include information about which server type and environment to choose when logging in to Fornax and the notebook's expected runtime given that setup (e.g. "As of 2024 August, this notebook takes about 3 minutes to run to completion on Fornax using Server Type: 'Standard - 8GB RAM/4 CPU' and Environment: 'Default Astrophysics' (image).")
Dependencies and imports:
- Does the notebook have a corresponding requirements_<notebook_filename>.txt file listing all its direct dependencies?
- Are all dependencies listed in the requirements file in fact required? Please revisit the list as the notebook evolves.
- Is the requirements file used in a commented-out cell in the notebook with # !pip install -r <filename>; and has the notebook no other installation related cells?
- Are dependencies kept to a minimum? E.g. no new library introduced for minimal usage while another library that is already a dependency can do the same functionality? (e.g. do not introduce pandas to print out a line from a FITS table for which we already need to use astropy for IO; add dependencies when their functionality is unique or required for efficiency, etc.)
Notebook execution, error handling, etc.:
- Does the notebook run end-to-end, out of the box?
- Are errors handled appropriately, with try/except statements that are narrow in scope?
- Have warnings been dealt with appropriately, preferably by updating the code to avoid them (i.e., not by simply silencing them)?
Efficiency:
- Is data accessed from the cloud where possible?
- Is the code parallelized where possible?
- If the notebook is intended to be scaled up, does it do that efficiently?
- Is memory usage optimized where possible?
Cleanup:
- Have blocks of code that need to be re-used been turned into functions (rather than being duplicated)?
- Have unused libraries been removed from the requirements.txt file and the import statements?
- Has unused code been removed (e.g., unused functions and commented-out lines)?
- Are comment lines wrapped so all fit within a max of 90 - 100 characters per line?
- Are code lines reasonably short where possible? some code lines can't easily be wrapped and that is ok
- Do plots use color-blind friendly palettes for plotting? try this simulator for visual check

trjaffe · 2025-09-26T18:21:01Z

data_access/data_access.md

-The best way to grab S3 cloud URI data from MAST is using <a href="https://astroquery.readthedocs.io/en/latest/mast/mast_obsquery.html#downloading-data-products">astroquery</a>. 
+Let's grab S3 cloud URI data from MAST using <a href="https://astroquery.readthedocs.io/en/latest/mast/mast_obsquery.html#downloading-data-products">astroquery</a>. This module has the *optional* ability to return the location of the data from AWS.
+
+*Look up which archives' astroquery modules do have this.*


Er, apparently none except us? And all three of ours look different.

Yes, that is correct and unfortunate. IRSA has a simple PyVO column 'cloud_access'. I like this one the best. It is direct and easy. MAST uses a astroquery function get_cloud_uris which is fine. IMO HEASARC has the most difficult approach - seemingly hiding it in a datalink table you must read and then find the appropriate line for any URL you may want.

trjaffe · 2025-09-26T18:22:20Z

data_access/data_access.md

 ```python
-url = filtered_table[8]['access_url']
+%skip
+#  WHY DOESNT THIS WORK?


Xamin issue I've just raised to higher priority.

It works for me? On my local that is, which does not have the same environment set up as Fornax (e.g., I run in Python 3.9.16).

trjaffe · 2025-09-26T18:23:55Z

data_access/data_access.md

-s3_file = "s3://nasa-irsa-wise/wise/allwise/images/p3am_cdd/08/0830/0830p227_ac51/0830p227_ac51-w2-int-3.fits"
+#s3_uri follows s3://<bucket-name>/<key> structure where the key is often 
+#   but not always <path>/<filename>
+bucket = json.loads(allwise_table['cloud_access'][0])['aws']['bucket_name']


This bit is the one I really think we ought to sue.

TJ's annotated and modified

bf827c6

trjaffe commented Sep 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TJ's annotated DRAFT PR not meant to be merged. #61

TJ's annotated DRAFT PR not meant to be merged. #61

Uh oh!

trjaffe commented Sep 26, 2025

Uh oh!

trjaffe Sep 26, 2025

Uh oh!

jleagle94 Sep 30, 2025

Uh oh!

trjaffe Sep 26, 2025

Uh oh!

jleagle94 Sep 30, 2025

Uh oh!

trjaffe Sep 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TJ's annotated DRAFT PR not meant to be merged. #61

Are you sure you want to change the base?

TJ's annotated DRAFT PR not meant to be merged. #61

Uh oh!

Conversation

trjaffe commented Sep 26, 2025

HEASARC demonstration and tutorial notebook review template

Requesting a review

Critical review criteria

Science review checklist

Tech review checklist

Uh oh!

trjaffe Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

jleagle94 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

trjaffe Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

jleagle94 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

trjaffe Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants