Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merging #5

Merged
merged 71 commits into from
Mar 30, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
71 commits
Select commit Hold shift + click to select a range
9bf87ed
ibm-wml-ce: Updates for 1.6.2
bethune-bryant Dec 6, 2019
4c7c958
Add description of CUDA hook issue and workarounds
maxpkatz Dec 6, 2019
971019c
Specify master_doc to avoid issues with older versions
maxpkatz Dec 8, 2019
bc6bc61
Merge pull request #173 from maxpkatz/cuda_hook_error
grahamlopez Dec 9, 2019
abdd56c
Merge pull request #172 from bethune-bryant/ibm-wml-ce-1.6.2
mpbelhorn Dec 9, 2019
a41eabe
ibm-wml-ce: add info on revision number.
bethune-bryant Dec 11, 2019
697b9ae
fix small build issues
Dec 12, 2019
878dd72
Merge pull request #182 from bethune-bryant/wml-ce-add-version
grahamlopez Dec 12, 2019
86ee996
use note block for version number note
Dec 12, 2019
cbe969a
Merge pull request #178 from maxpkatz/master_doc
grahamlopez Dec 12, 2019
1538b80
added training archive to docs
Dec 12, 2019
f39fe55
Merge pull request #184 from tom-papatheodore/add_training_archive
grahamlopez Dec 12, 2019
b47727c
Merge pull request #183 from grahamlopez/wml-ce-note
jack-morrison Dec 13, 2019
a83f9bf
Makes .wy-table-responsive always scrollable in situations where cont…
Dec 19, 2019
9dad669
added OLCF GPU Hackathons page to Training
Dec 19, 2019
9d79c5a
Fixes #165. Adds in-context links to IBM WML User Conf. Call
Dec 19, 2019
9c0f4e0
Typo correction
Dec 19, 2019
36e10eb
Merge pull request #188 from tom-papatheodore/add-gpu-hackathon-page
jack-morrison Dec 19, 2019
f674c62
ibm-wml-ce: update horovod install instructions
bethune-bryant Jan 10, 2020
8eaf637
Merge pull request #189 from jack-morrison/wml-links
jack-morrison Jan 13, 2020
ddc07f4
Merge pull request #191 from bethune-bryant/ibmwmlce_fix_horovod
jack-morrison Jan 13, 2020
3da0541
Fixes #187
Jan 23, 2020
359a296
Fixes #185. Removes 'aprun to jsrun' section of Summit User Guide.
Jan 23, 2020
c1dc7d7
Merge pull request #192 from jack-morrison/issue-187
jack-morrison Jan 23, 2020
d1bfd09
Note Spectrum MPI + Nsight Compute incompatibility
maxpkatz Jan 29, 2020
e96b3b5
Merge pull request #186 from jack-morrison/responsive-tables
grahamlopez Jan 30, 2020
f39d1a2
Merge pull request #193 from jack-morrison/issue-185
grahamlopez Jan 30, 2020
c9c0b61
Note new last updated date
maxpkatz Feb 1, 2020
bb59a53
Merge pull request #194 from maxpkatz/nsight_compute_bug_doc
grahamlopez Feb 3, 2020
b4fbd35
Fixes #195.
Feb 10, 2020
6c16294
Merge pull request #197 from jack-morrison/issue-195
jack-morrison Feb 10, 2020
8759165
fixes 198
Feb 10, 2020
af2cdf4
Merge pull request #199 from tyoung31/issue-198
jack-morrison Feb 11, 2020
dca6ab5
Fixes #203. Adds ORNL Favicon.
Feb 19, 2020
22d25d2
Merge pull request #206 from jack-morrison/issue-203
jack-morrison Feb 19, 2020
9dfcfab
Adds note re: FTP. Fixes #205.
gmarkomanolis Feb 24, 2020
d94f718
Updated HPSS paths; modified other user/project-centric paths for con…
warenaud Feb 24, 2020
92f5cb9
Adds Job Step Viewer to jsrun docs (#200)
tyoung31 Feb 24, 2020
20b74f9
TAU OpenMP offload (#202)
gmarkomanolis Feb 24, 2020
c198e33
Adds OLCF logo to user docs.
Feb 25, 2020
3188068
Merge pull request #209 from jack-morrison/issue-6
verolero86 Feb 25, 2020
c460286
Changed layout of Training Archive to table entries (#210)
tom-papatheodore Feb 26, 2020
f1e9df8
Updated the list of OLCF-sponsored GPU hackathons (#211)
tom-papatheodore Feb 27, 2020
cee2a99
updated main training landing page and removed individual sections fr…
tom-papatheodore Mar 2, 2020
1dd0317
Fixes #217
Mar 2, 2020
d0594b1
Merge pull request #218 from jack-morrison/issue-217
tom-papatheodore Mar 2, 2020
b76d17b
Adds external link icon to main nav training section
Mar 3, 2020
627d942
Adds external link icon to main nav for any external <a> element
Mar 3, 2020
2bcd026
Merge pull request #219 from jack-morrison/issue-215
tom-papatheodore Mar 3, 2020
61eb545
Adds User-Managed Software (UMS) area. Adds Flux Framework project UM…
jack-morrison Mar 3, 2020
5eb3c01
ibm-wml-ce changes for 1.7.0 (#213)
bethune-bryant Mar 6, 2020
402a573
ibm-wml-ce: Remove obsolete horovod build instructions and update ver…
bethune-bryant Mar 6, 2020
262b21f
Fixes #220. Updates project/user graphic. (#221)
jack-morrison Mar 9, 2020
25748d7
Adds external link to software news on olcf main site. Will migrate c…
jack-morrison Mar 10, 2020
e6fe34d
Resolves #225.
Mar 10, 2020
2650ba5
Merge pull request #226 from jack-morrison/issue-225
isubsmith Mar 10, 2020
3d545c5
Change new PGI version to 19.9
jack-morrison Mar 11, 2020
0dce9be
Merge pull request #227 from olcf/jack-morrison-pgi-version-fix
verolero86 Mar 11, 2020
879bbf8
Fix CUDA version
verolero86 Mar 11, 2020
10c1d60
Merge pull request #229 from olcf/verolero86-cudaversiontypo
verolero86 Mar 11, 2020
e2fe841
New versions not Default (#228)
verolero86 Mar 12, 2020
52e44f4
Adds note to homepage that UAC is still open for support.
Mar 18, 2020
0e2fbce
Merge pull request #230 from jack-morrison/uac-open-notice
verolero86 Mar 18, 2020
b5dbc1f
Add segfault on login node to known issues (#231)
verolero86 Mar 20, 2020
27c7916
Bring training archive up-to-date as of 2020-03-20 (#232)
tom-papatheodore Mar 20, 2020
8f6925b
adds 1QCY20 concalls to training archive (#233)
warenaud Mar 27, 2020
ae2c32c
adds info on user home dir snapshot backups
Mar 27, 2020
6904da5
adds info on project home dir snapshot backups
Mar 27, 2020
8d6ef9c
typo correction
Mar 27, 2020
8d96598
Adds user home and proj home paths for clarity
Mar 27, 2020
5a997de
Merge pull request #234 from jack-morrison/jackm-issue153
isubsmith Mar 27, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 25 additions & 5 deletions _static/css/theme_overrides.css
Original file line number Diff line number Diff line change
@@ -1,11 +1,31 @@
.wy-side-nav-search, .wy-nav-top {
background: #007833;
}

.wy-nav-content {
max-width: 1200px;
}

/* Adds whitespace between OLCF logo and Docs Home link */
body > div > nav > div > div.wy-side-nav-search > a > img {
padding-bottom: 10px;
}

/* Clicking on the OLCF logo does nothing
(disable RTD theme's default <a> behavior) */
body > div > nav > div > div.wy-side-nav-search > a{
pointer-events: none;
cursor: default;
}

/* Supersede the above block, and allow the Docs Home link to be clickable */
body > div > nav > div > div.wy-side-nav-search > a > a{
pointer-events: auto !important;
cursor: pointer !important;
color: grey !important;
}

/* Don't let the color of the Docs Home link change. */
body > div > nav > div > div.wy-side-nav-search > a > a:visited{
color: grey !important;
}

/* override table width restrictions */
@media screen and (min-width: 767px) {

Expand All @@ -16,6 +36,6 @@
}

.wy-table-responsive {
overflow: visible !important;
overflow: auto !important;
}
}
29 changes: 29 additions & 0 deletions _static/js/custom.js
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
$( document ).ready(function() {

// Create link and text for navigation back to the OLCF home page
var olcf_link = document.createElement("a");
var olcf_text = document.createTextNode("OLCF Home Page");
Expand All @@ -17,4 +18,32 @@ $( document ).ready(function() {
aside.appendChild(separator);
aside.appendChild(olcf_link);

// Insert Project Name "OLCF User Documentation" below html_logo in sidebar navigation
var project_name_link = document.createElement("a");
var project_name_text = document.createTextNode(" OLCF User Documentation");
project_name_link.appendChild(project_name_text);
project_name_link.setAttribute("href", "https://docs.olcf.ornl.gov");
project_name_link.classList.add("icon");
project_name_link.classList.add("icon-home");
wysidenavsearch = document.querySelector("body > div > nav > div > div.wy-side-nav-search > a");
wysidenavsearch.appendChild(project_name_link);


// For any external links in the main navigation, append the FontAwesome external link icon.
function iconize_external_links(nav_level){
a_elements = nav_level.getElementsByTagName("A");
for (var i = 0; i < a_elements.length; ++i){
if (a_elements[i].getAttribute("href").includes("http")){
var icon = document.createElement("i");
icon.classList.add("fa");
icon.classList.add("fa-external-link");
var spacer = document.createTextNode(" ");
a_elements[i].appendChild(spacer);
a_elements[i].appendChild(icon);
}
}
}

iconize_external_links(document.querySelector("body > div > nav > div > div.wy-menu.wy-menu-vertical"))

});
255 changes: 157 additions & 98 deletions accounts/olcf_policy_guide.rst

Large diffs are not rendered by default.

10 changes: 10 additions & 0 deletions conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -46,6 +46,10 @@
# This pattern also affects html_static_path and html_extra_path.
exclude_patterns = ['_build', 'Thumbs.db', '.DS_Store']

# To avoid errors with older versions of Sphinx and Sphinx RTD theme, explicitly
# specify the master document.
master_doc = 'index'


# -- Options for HTML output -------------------------------------------------

Expand All @@ -70,6 +74,10 @@
'js/custom.js',
]

html_logo = 'images/olcf_logo.png'

html_favicon = 'images/favicon.ico'

html_context = {
'vcs_pageview_mode': 'edit',
'display_github': True,
Expand All @@ -86,6 +94,8 @@
'sticky_navigation': True,
'navigation_depth': 4,
'style_external_links': True,
'style_nav_header_background': '#efefef',
'logo_only': True,
}


Expand Down
3 changes: 2 additions & 1 deletion contributing/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,8 @@ Setup authoring environment
$ git clone https://github.com/<your-github-id>/olcf-user-docs.git

#. Point your master branch to track upstream::


$ cd olcf-user-docs
$ git remote add olcf https://github.com/olcf/olcf-user-docs.git
$ git fetch olcf
$ git branch --set-upstream-to=olcf/master
Expand Down
28 changes: 12 additions & 16 deletions data/archiving.rst
Original file line number Diff line number Diff line change
Expand Up @@ -101,17 +101,11 @@ HPSS, you might use:
and ``hsi mput`` can be used to retrieve multiple files. To send a file to HPSS,
you might use:

``hsi put a.out``

To put a file in a pre-existing directory on hpss:


``hsi “cd MyHpssDir; put a.out”``
``hsi put a.out : /hpss/prod/[projid]/users/[userid]/a.out``

To retrieve one, you might use:


``hsi get /proj/projectid/a.out``
``hsi get /hpss/prod/[projid]/proj-shared/a.out``

Here is a list of commonly used hsi commands.

Expand Down Expand Up @@ -159,32 +153,32 @@ As with the standard Unix ``tar`` utility the ``-c``, ``-x``, and ``-t``
options, respectively, function to create, extract, and list tar archive files.
The ``-K`` option verifies an existing tarfile in HPSS and the ``-X`` option can
be used to re-create the index file for an existing archive. For example, to
store all files in the directory ``dir1`` to a file named ``allfiles.tar`` on
HPSS, use the command:
store all files in the directory ``dir1`` to a file named
``/hpss/prod/[projid]/users/[userid]/allfiles.tar`` on HPSS, use the command:

``htar -cvf allfiles.tar dir1/*``
``htar -cvf /hpss/prod/[projid]/users/[userid]/allfiles.tar dir1/*``

To retrieve these files:

``htar -xvf allfiles.tar``
``htar -xvf /hpss/prod/[projid]/users/[userid]/allfiles.tar``

``htar`` will overwrite files of the same name in the target directory. **When
possible, extract only the files you need from large archives.** To display the
names of the files in the ``project1.tar`` archive file within the HPSS home
directory:

``htar -vtf project1.tar``
``htar -vtf /hpss/prod/[projid]/users/[userid]/project1.tar``

To extract only one file, ``executable.out``, from the ``project1`` directory in
the Archive file called ``project1.tar``:
the Archive file called `` /hpss/prod/[projid]/users/[userid]/project1.tar``:

``htar -xm -f project1.tar project1/ executable.out``

To extract all files from the ``project1/src`` directory in the archive file
called ``project1.tar``, and use the time of extraction as the modification
time, use the following command:

``htar -xm -f project1.tar project1/src``
``htar -xm -f /hpss/prod/[projid]/users/[userid]/project1.tar project1/src``

HTAR Limitations
-----------------
Expand Down Expand Up @@ -217,7 +211,9 @@ Maximum Number of Files per Archive 1 million
For example, when attempting to HTAR a directory with one member file larger
that 64GB, the following error message will appear:

``htar -cvf hpss_test.tar hpss_test/``
.. code::

$ htar -cvf /hpss/prod/[projid]/users/[userid]/hpss_test.tar hpss_test/

INFO: File too large for htar to handle: hpss_test/75GB.dat (75161927680 bytes)
ERROR: 1 oversize member files found - please correct and retry
Expand Down
82 changes: 56 additions & 26 deletions data/policies.rst
Original file line number Diff line number Diff line change
Expand Up @@ -9,32 +9,62 @@ Policy
A brief description of each area and basic guidelines to follow are provided in
the table below:

+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *Name* | Path | Type | Permissions | Backups | Purged | Quota | Mounted on Compute nodes |
+===================+=============================================+===========================+=============+=========+=========+=======+==========================+
| *User Home* | ``$HOME`` | NFS | User Set | yes | no | 50GB | Read-only |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *Project Home* | ``/ccs/proj/[projid]`` | NFS | 770 | yes | no | 50GB | Read-only |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *User Archive* | ``/home/$USER`` | HPSS | User Set | no | no | 2TB | No |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *Project Archive* | ``/proj/[projid]`` | HPSS | 770 | no | no | 100TB | No |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *Member Work* | ``/gpfs/alpine/scratch/[userid]/[projid]/`` | Spectrum Scale (ex. GPFS) | 700 | no | 90 days | 50TB | Yes |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *Project Work* | ``/gpfs/alpine/proj-shared/[projid]`` | Spectrum Scale (ex. GPFS) | 770 | no | 90 days | 50TB | Yes |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+
| *World Work* | ``/gpfs/alpine/world-shared/[projid]`` | Spectrum Scale (ex. GPFS) | 775 | no | 90 days | 50TB | Yes |
+-------------------+---------------------------------------------+---------------------------+-------------+---------+---------+-------+--------------------------+


On Summit paths to the various project-centric work storage areas are simplified
by the use of environment variables that point to the proper directory on a
per-user basis:

- Member Work Directory: ``$MEMBERWORK/[projid]``
- Project Work Directory: ``$PROJWORK/[projid]``
- World Work Directory: ``$WORLDWORK/[projid]``
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Area | Path | Type | Permissions | Quota | Backups | Purged | Retention | On Compute Nodes |
+=====================+=============================================+================+=============+========+=========+=========+============+==================+
| User Home | ``/ccs/home/[userid]`` | NFS | User set | 50 GB | Yes | No | 90 days | Read-only |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| User Archive [#f1]_ | ``/home/[userid]`` | HPSS | User set | 2TB | No | No | 90 days | No |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| User Archive [#f2]_ | ``/home/[userid]`` | HPSS | 700 | N/A | N/A | N/A | N/A | No |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Project Home | ``/ccs/proj/[projid]`` | NFS | 770 | 50 GB | Yes | No | 90 days | Read-only |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Member Work | ``/gpfs/alpine/[projid]/scratch/[userid]`` | Spectrum Scale | 700 [#f3]_ | 50 TB | No | 90 days | N/A [#f4]_ | Yes |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Project Work | ``/gpfs/alpine/[projid]/proj-shared`` | Spectrum Scale | 770 | 50 TB | No | 90 days | N/A [#f4]_ | Yes |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| World Work | ``/gpfs/alpine/[projid]/world-shared`` | Spectrum Scale | 775 | 50 TB | No | 90 days | N/A [#f4]_ | Yes |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Member Archive | ``/hpss/prod/[projid]/users/$USER`` | HPSS | 700 | 100 TB | No | No | 90 days | No |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| Project Archive | ``/hpss/prod/[projid]/proj-shared`` | HPSS | 770 | 100 TB | No | No | 90 days | No |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+
| World Archive | ``/hpss/prod/[projid]/world-shared`` | HPSS | 775 | 100 TB | No | No | 90 days | No |
+---------------------+---------------------------------------------+----------------+-------------+--------+---------+---------+------------+------------------+

| *Area -* The general name of storage area.
| *Path -* The path (symlink) to the storage area's directory.
| *Type -* The underlying software technology supporting the storage area.
| *Permissions -* UNIX Permissions enforced on the storage area's top-level directory.
| *Quota -* The limits placed on total number of bytes and/or files in the storage area.
| *Backups -* States if the data is automatically duplicated for disaster recovery purposes.
| *Purged -* Period of time, post-file-access, after which a file will be marked as eligible for permanent deletion.
| *Retention -* Period of time, post-account-deactivation or post-project-end, after which data will be marked as eligible for permanent deletion.
| *On Compute Nodes -* Is this filesystem available on compute nodes (yes, no, or available but read-only)

**Important!** Files within "Work" directories (i.e., Member Work,
Project Work, World Work) are *not* backed up and are *purged* on a
regular basis according to the timeframes listed above.

.. rubric:: Footnotes

.. [#f1] This entry is for legacy User Archive directories which contained user data on January 14, 2020.

.. [#f2] User Archive directories that were created (or had no user data) after January 14, 2020. Settings other than permissions are not applicable because directories are root-owned and contain no user files.

.. [#f3] Permissions on Member Work directories can be controlled to an extent by project members. By default, only the project member has any accesses, but accesses can be granted to other project members by setting group permissions accordingly on the Member Work directory. The parent directory of the Member Work directory prevents accesses by "UNIX-others" and cannot be changed (security measures).

.. [#f4] Retention is not applicable as files will follow purge cycle.



On Summit, Rhea and the DTNs, additional paths to the various project-centric work areas are available
via the following symbolic links and/or environment variables:

- Member Work Directory: ``/gpfs/alpine/scratch/[userid]/[projid]`` or ``$MEMBERWORK/[projid]``
- Project Work Directory: ``/gpfs/alpine/proj-shared/[projid]`` or ``$PROJWORK/[projid]``
- World Work Directory: ``/gpfs/alpine/world-shared/[projid]`` or ``$WORLDWORK/[projid]``

Information
============
Expand Down
Loading