Doc changes

2021-05-21 12:34:19 +02:00
parent 42d8f38934
commit fcfdbf1344
46 changed files with 447 additions and 528 deletions
--- a/pages/merlin6/02-How-To-Use-Merlin/archive.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/archive.md
@ -0,0 +1,368 @@
+---
+title: Archive & PSI Data Catalog
+
+#tags:
+keywords: Linux, archive, DataCatalog, 
+last_updated: 31 January 2020
+summary: "This document describes how to use the PSI Data Catalog for archiving Merlin6 data."
+sidebar: merlin6_sidebar
+permalink: /merlin6/archive.html
+---
+
+## PSI Data Catalog as a PSI Central Service
+
+PSI provides access to the ***Data Catalog*** for **long-term data storage and retrieval**. Data is
+stored on the ***PetaByte Archive*** at the **Swiss National Supercomputing Centre (CSCS)**.
+
+The Data Catalog and Archive is suitable for:
+
+* Raw data generated by PSI instruments
+* Derived data produced by processing some inputs
+* Data required to reproduce PSI research and publications
+
+The Data Catalog is part of PSI's effort to conform to the FAIR principles for data management. 
+In accordance with this policy, ***data will be publicly released under CC-BY-SA 4.0 after an 
+embargo period expires.***
+
+The Merlin cluster is connected to the Data Catalog. Hence, users archive data stored in the 
+Merlin storage under the ``/data`` directories (currentlyi, ``/data/user`` and ``/data/project``).
+Archiving from other directories is also possible, however the process is much slower as data
+can not be directly retrieved by the PSI archive central servers (**central mode**), and needs to 
+be indirectly copied to these (**decentral mode**).
+
+Archiving can be done from any node accessible by the users (usually from the login nodes).
+
+{{site.data.alerts.tip}} Archiving can be done in two different ways:
+<br>
+<b>'Central mode':</b> Possible for the user and project data directories, is the
+fastest way as it does not require remote copy (data is directly retreived by central AIT servers from Merlin
+through 'merlin-archive.psi.ch').
+<br>
+<br>
+<b>'Decentral mode':</b> Possible for any directory, is the slowest way of archiving as it requires
+to copy ('rsync') the data from Merlin to the central AIT servers.
+{{site.data.alerts.end}}
+
+## Procedure
+
+### Overview
+
+Below are the main steps for using the Data Catalog.
+
+* Ingest the dataset into the Data Catalog. This makes the data known to the Data Catalog system at PSI:
+   * Prepare a metadata file describing the dataset
+   * Run **``datasetIngestor``** script
+   * If necessary, the script will copy the data to the PSI archive servers 
+     * Usually this is necessary when archiving from directories other than **``/data/user``** or 
+       **``/data/project``**. It would be also necessary when the Merlin export server (**``merlin-archive.psi.ch``**)
+       is down for any reason.
+* Archive the dataset:
+   * Visit [https://discovery.psi.ch](https://discovery.psi.ch)
+   * Click **``Archive``** for the dataset
+   * The system will now copy the data to the PetaByte Archive at CSCS 
+* Retrieve data from the catalog:
+   * Find the dataset on [https://discovery.psi.ch](https://discovery.psi.ch) and click **``Retrieve``**
+   * Wait for the data to be copied to the PSI retrieval system
+   * Run **``datasetRetriever``** script 
+
+Since large data sets may take a lot of time to transfer, some steps are designed to happen in the 
+background. The discovery website can be used to track the progress of each step. 
+
+###  Account Registration
+
+Two types of account permit access to the Data Catalog. If your data was collected at a ***beamline***, you may 
+have been assigned a **``p-group``** (e.g. ``p12345``) for the experiment. Other users are assigned **``a-group``** 
+(e.g. ``a-12345``). 
+
+Groups are usually assigned to a PI, and then individual user accounts are added to the group. This must be done
+under user request through PSI Service Now. For existing **a-groups** and **p-groups**, you can follow the standard
+central procedures. Alternatively, if you do not know how to do that, follow the Merlin6 
+**[Requesting extra Unix groups](/merlin6/request-account.html#requesting-extra-unix-groups)** procedure, or open
+a **[PSI Service Now](https://psi.service-now.com/psisp)** ticket.
+
+### Installation
+
+Accessing the Data Catalog is done through the [SciCat software](https://melanie.gitpages.psi.ch/SciCatPages/).
+Documentation is here: [ingestManual.pdf](https://melanie.gitpages.psi.ch/SciCatPages/ingestManual.pdf).
+
+#### (Merlin systems) Loading datacatalog tools
+
+This is the ***official supported method*** for archiving the Merlin cluster.
+
+The latest datacatalog software is maintained in the PSI module system. To access it from the Merlin systems, run the following command:
+
+```bash
+module load datacatalog
+```
+
+It can be done from any host in the Merlin cluster accessible by users. Usually, login nodes will be the nodes used for archiving.
+
+#### (Non-standard systems) Installing datacatalog tools
+
+***This method is not supported by the Merlin admins***. However, we provide a small recipe for archiving from any host at PSI.
+On any problems, Central AIT should be contacted.
+
+If you do not have access to PSI modules (for instance, when archiving from Ubuntu systems), then you can install the 
+datacatalog software yourself. These tools require 64-bit linux. To ingest from Windows systems, it is suggested to 
+transfer the data to a linux system such as Merlin.
+
+We suggest storing the SciCat scripts in ``~/bin`` so that they can be easily accessed.
+
+
+```bash
+mkdir -p ~/bin
+cd ~/bin
+/usr/bin/curl -O https://intranet.psi.ch/pub/Daas/WebHome/datasetIngestor
+chmod +x ./datasetIngestor
+/usr/bin/curl -O https://intranet.psi.ch/pub/Daas/WebHome/datasetRetriever
+chmod +x ./datasetRetriever
+```
+
+When the scripts are updated you will be prompted to re-run some of the above commands to get the latest version.
+
+You can call the ingestion scripts using the full path (``~/bin/datasetIngestor``) or else add ``~/bin`` to your unix PATH. 
+To do so, add the following line to your ``~/.bashrc`` file:
+
+```bash
+export PATH="$HOME/bin:$PATH"
+```
+
+### Ingestion
+
+The first step to ingesting your data into the catalog is to prepare a file describing what data you have. This is called 
+**``metadata.json``**, and can be created with a text editor (e.g. *``vim``*). It can in principle be saved anywhere, 
+but keeping it with your archived data is recommended. For more information about the format, see the 'Bio metadata' 
+section below. An example follows:
+
+```yaml
+{
+    "principalInvestigator": "albrecht.gessler@psi.ch",
+    "creationLocation": "/PSI/EMF/JEOL2200FS",
+    "dataFormat": "TIFF+LZW Image Stack",
+    "sourceFolder": "/gpfs/group/LBR/pXXX/myimages",
+    "owner": "Wilhelm Tell",
+    "ownerEmail": "wilhelm.tell@psi.ch",
+    "type": "raw",
+    "description": "EM micrographs of amygdalin",
+    "ownerGroup": "a-12345",
+    "scientificMetadata": {
+        "description": "EM micrographs of amygdalin",
+        "sample": {
+            "name": "Amygdalin beta-glucosidase 1",
+            "uniprot": "P29259",
+            "species": "Apple"
+        },
+        "dataCollection": {
+            "date": "2018-08-01"
+        },
+        "microscopeParameters": {
+            "pixel size": {
+                "v": 0.885,
+                "u": "A"
+            },
+            "voltage": {
+                "v": 200,
+                "u": "kV"
+            },
+            "dosePerFrame": {
+                "v": 1.277,
+                "u": "e/A2"
+            }
+        }
+    }
+}
+```
+
+The following steps can be run from wherever you saved your ``metadata.json``. First, perform a "dry-run" which will check the metadata for errors:
+
+```bash
+datasetIngestor metadata.json
+```
+
+It will ask for your PSI credentials and then print some info about the data to be ingested. If there are no errors, proceed to the real ingestion:
+
+```bash
+datasetIngestor --ingest --autoarchive metadata.json
+```
+
+For particularly important datasets, you may also want to use the parameter **``--tapecopies 2``** to store **redundant copies** of the data.
+
+You will be asked whether you want to copy the data to the central system: 
+
+* If you are on the Merlin cluster and you are archiving data from ``/data/user`` or ``/data/project``, answer 'no' since the data catalog can 
+directly read the data.
+* If you are on a directory other than  ``/data/user`` and ``/data/project, or you are on a desktop computer, answer 'yes'. Copying large datasets 
+to the PSI archive system may take quite a while (minutes to hours).
+
+If there are no errors, your data has been accepted into the data catalog! From now on, no changes should be made to the ingested data. 
+This is important, since the next step is for the system to copy all the data to the CSCS Petabyte archive. Writing to tape is slow, so 
+this process may take several days, and it will fail if any modifications are detected.
+
+If using the ``--autoarchive`` option as suggested above, your dataset should now be in the queue. Check the data catalog: 
+[https://discovery.psi.ch](https://discovery.psi.ch). Your job should have status 'WorkInProgress'. You will receive an email when the ingestion 
+is complete.
+
+If you didn't use ``--autoarchive``, you need to manually move the dataset into the archive queue. From **discovery.psi.ch**, navigate to the 'Archive' 
+tab. You should see the newly ingested dataset. Check the dataset and click **``Archive``**. You should see the status change from **``datasetCreated``** to 
+**``scheduleArchiveJob``**. This indicates that the data is in the process of being transferred to CSCS.
+
+After a few days the dataset's status will change to **``datasetOnAchive``** indicating the data is stored. At this point it is safe to delete the data.
+
+#### Useful commands
+
+Running the datasetIngestor in dry mode (**without** ``--ingest``) finds most errors. However, it is sometimes convenient to find potential errors 
+yourself with simple unix commands.
+
+Find problematic filenames
+
+```bash
+find . -iregex '.*/[^/]*[^a-zA-Z0-9_ ./-][^/]*'=
+```
+
+Find broken links
+
+```bash
+find -L . -type l
+```
+
+Find outside links
+
+```bash
+find . -type l -exec bash -c 'realpath --relative-base "`pwd`" "$0" 2>/dev/null |egrep "^[./]" |sed "s|^|$0 ->|" ' '{}' ';'
+```
+
+Delete certain files (use with caution)
+
+```bash
+# Empty directories
+find . -type d -empty -delete
+# Backup files
+find . -name '*~' -delete
+find . -name '*#autosave#' -delete
+```
+
+### Troubleshooting & Known Bugs
+
+* The following message can be safely ignored:
+
+   ```bash
+   key_cert_check_authority: invalid certificate
+   Certificate invalid: name is not a listed principal
+   ```
+   It indicates that no kerberos token was provided for authentication. You can avoid the warning by first running kinit (PSI linux systems).
+   
+* For decentral ingestion cases, the copy step is indicated by a message ``Running [/usr/bin/rsync -e ssh -avxz ...``. It is expected that this 
+step will take a long time and may appear to have hung. You can check what files have been successfully transfered using rsync:
+
+   ```bash
+   rsync --list-only user_n@pb-archive.psi.ch:archive/UID/PATH/
+   ```
+
+   where UID is the dataset ID (12345678-1234-1234-1234-123456789012) and PATH is the absolute path to your data. Note that rsync creates directories first and that the transfer order is not alphabetical in some cases, but it should be possible to see whether any data has transferred.
+
+* There is currently a limit on the number of files per dataset (technically, the limit is from the total length of all file paths). It is recommended to break up datasets into 300'000 files or less. 
+
+<details>
+<summary>[Show Example]: Sample ingestion output (datasetIngestor 1.1.11)</summary>
+<pre class="terminal code highlight js-syntax-highlight plaintext" lang="plaintext" markdown="false">
+/data/project/bio/myproject/archive $ datasetIngestor -copy -autoarchive -allowexistingsource -ingest metadata.json
+2019/11/06 11:04:43 Latest version: 1.1.11
+
+
+2019/11/06 11:04:43 Your version of this program is up-to-date
+2019/11/06 11:04:43 You are about to add a dataset to the === production === data catalog environment...
+2019/11/06 11:04:43 Your username:
+user_n
+2019/11/06 11:04:48 Your password:
+2019/11/06 11:04:52 User authenticated: XXX
+2019/11/06 11:04:52 User is member in following a or p groups: XXX
+2019/11/06 11:04:52 OwnerGroup information a-XXX verified successfully.
+2019/11/06 11:04:52 contactEmail field added: XXX
+2019/11/06 11:04:52 Scanning files in dataset /data/project/bio/myproject/archive
+2019/11/06 11:04:52 No explicit filelistingPath defined - full folder /data/project/bio/myproject/archive is used.
+2019/11/06 11:04:52 Source Folder: /data/project/bio/myproject/archive at /data/project/bio/myproject/archive
+2019/11/06 11:04:57 The dataset contains 100000 files with a total size of 50000000000 bytes.
+2019/11/06 11:04:57 creationTime field added: 2019-07-29 18:47:08 +0200 CEST
+2019/11/06 11:04:57 endTime field added: 2019-11-06 10:52:17.256033 +0100 CET
+2019/11/06 11:04:57 license field added: CC BY-SA 4.0
+2019/11/06 11:04:57 isPublished field added: false
+2019/11/06 11:04:57 classification field added: IN=medium,AV=low,CO=low
+2019/11/06 11:04:57 Updated metadata object:
+{
+    "accessGroups": [
+        "XXX"
+    ],
+    "classification": "IN=medium,AV=low,CO=low",
+    "contactEmail": "XXX",
+    "creationLocation": "XXX",
+    "creationTime": "2019-07-29T18:47:08+02:00",
+    "dataFormat": "XXX",
+    "description": "XXX",
+    "endTime": "2019-11-06T10:52:17.256033+01:00",
+    "isPublished": false,
+    "license": "CC BY-SA 4.0",
+    "owner": "XXX",
+    "ownerEmail": "XXX",
+    "ownerGroup": "a-XXX",
+    "principalInvestigator": "XXX",
+    "scientificMetadata": {
+...
+    },
+    "sourceFolder": "/data/project/bio/myproject/archive",
+    "type": "raw"
+}
+2019/11/06 11:04:57 Running [/usr/bin/ssh -l user_n pb-archive.psi.ch test -d /data/project/bio/myproject/archive].
+key_cert_check_authority: invalid certificate
+Certificate invalid: name is not a listed principal
+user_n@pb-archive.psi.ch's password:
+2019/11/06 11:05:04 The source folder /data/project/bio/myproject/archive is not centrally available (decentral use case).
+The data must first be copied to a rsync cache server.
+
+
+2019/11/06 11:05:04 Do you want to continue (Y/n)?
+Y
+2019/11/06 11:05:09 Created dataset with id 12.345.67890/12345678-1234-1234-1234-123456789012
+2019/11/06 11:05:09 The dataset contains 108057 files.
+2019/11/06 11:05:10 Created file block 0 from file 0 to 1000 with total size of 413229990 bytes
+2019/11/06 11:05:10 Created file block 1 from file 1000 to 2000 with total size of 416024000 bytes
+2019/11/06 11:05:10 Created file block 2 from file 2000 to 3000 with total size of 416024000 bytes
+2019/11/06 11:05:10 Created file block 3 from file 3000 to 4000 with total size of 416024000 bytes
+...
+2019/11/06 11:05:26 Created file block 105 from file 105000 to 106000 with total size of 416024000 bytes
+2019/11/06 11:05:27 Created file block 106 from file 106000 to 107000 with total size of 416024000 bytes
+2019/11/06 11:05:27 Created file block 107 from file 107000 to 108000 with total size of 850195143 bytes
+2019/11/06 11:05:27 Created file block 108 from file 108000 to 108057 with total size of 151904903 bytes
+2019/11/06 11:05:27 short dataset id: 0a9fe316-c9e7-4cc5-8856-e1346dd31e31
+2019/11/06 11:05:27 Running [/usr/bin/rsync -e ssh -avxz /data/project/bio/myproject/archive/ user_n@pb-archive.psi.ch:archive
+/0a9fe316-c9e7-4cc5-8856-e1346dd31e31/data/project/bio/myproject/archive].
+key_cert_check_authority: invalid certificate
+Certificate invalid: name is not a listed principal
+user_n@pb-archive.psi.ch's password:
+Permission denied, please try again.
+user_n@pb-archive.psi.ch's password:
+/usr/libexec/test_acl.sh: line 30: /tmp/tmpacl.txt: Permission denied
+/usr/libexec/test_acl.sh: line 30: /tmp/tmpacl.txt: Permission denied
+/usr/libexec/test_acl.sh: line 30: /tmp/tmpacl.txt: Permission denied
+/usr/libexec/test_acl.sh: line 30: /tmp/tmpacl.txt: Permission denied
+/usr/libexec/test_acl.sh: line 30: /tmp/tmpacl.txt: Permission denied
+...
+2019/11/06 12:05:08 Successfully updated {"pid":"12.345.67890/12345678-1234-1234-1234-123456789012",...}
+2019/11/06 12:05:08 Submitting Archive Job for the ingested datasets.
+2019/11/06 12:05:08 Job response Status: okay
+2019/11/06 12:05:08 A confirmation email will be sent to XXX
+12.345.67890/12345678-1234-1234-1234-123456789012
+</pre>
+</details>
+
+### Retrieving data
+
+The retrieval process is still a work in progress. For more info, read the ingest manual. 
+
+## Further Information
+
+* **[PSI Data Catalog](https://discovery.psi.ch)**
+* **[Full Documentation](https://melanie.gitpages.psi.ch/SciCatPages/)**: **[PDF](https://melanie.gitpages.psi.ch/SciCatPages/ingestManual.pdf)**.
+* Data Catalog **[Official Website](https://www.psi.ch/photon-science-data-services/data-catalog-and-archive)**
+* Data catalog **[SciCat Software](https://scicatproject.github.io/)**
+* **[FAIR](https://www.nature.com/articles/sdata201618)** definition and **[SNF Research Policy](http://www.snf.ch/en/theSNSF/research-policies/open_research_data/Pages/default.aspx#FAIR%20Data%20Principles%20for%20Research%20Data%20Management)**
+* **[Petabyte Archive at CSCS](https://www.cscs.ch/fileadmin/user_upload/contents_publications/annual_reports/AR2017_Online.pdf)**
--- a/pages/merlin6/02-How-To-Use-Merlin/connect-from-linux.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/connect-from-linux.md
@ -0,0 +1,51 @@
+---
+title: Connecting from a Linux Client
+
+#tags:
+keywords: Linux, connecting, client, configuration, SSH, X11
+last_updated: 23 Oct 2019
+summary: "This document describes a recommended setup for a Linux client."
+sidebar: merlin6_sidebar
+permalink: /merlin6/connect-from-linux.html
+---
+
+## SSH without X11 Forwarding
+
+This is the standard method. Official X11 support is provided through [NoMachine](/merlin6/nomachine.html).
+For normal SSH sessions, use your SSH client as follows:
+
+```bash
+ssh $username@merlin-l-01.psi.ch
+ssh $username@merlin-l-001.psi.ch
+ssh $username@merlin-l-002.psi.ch
+```
+
+## SSH with X11 Forwarding
+
+Official X11 Forwarding support is through NoMachine. Please follow the document
+[{Job Submission -> Interactive Jobs}](/merlin6/interactive-jobs.html#Requirements) and
+[{Accessing Merlin -> NoMachine}](/merlin6/nomachine.html) for more details. However,
+we provide a small recipe for enabling X11 Forwarding in Linux.
+
+* For enabling client X11 forwarding, add the following to the start of ``~/.ssh/config``
+to implicitly add ``-Y`` to all ssh connections:
+
+   ```bash
+   ForwardAgent yes
+   ForwardX11Trusted yes
+   ```
+
+* Alternatively, you can add the option ``-Y`` to the ``ssh`` command. In example:
+
+   ```bash
+   ssh -Y $username@merlin-l-01.psi.ch
+   ssh -Y $username@merlin-l-001.psi.ch
+   ssh -Y $username@merlin-l-002.psi.ch
+   ```
+
+* For testing that X11 forwarding works, just run ``xclock``. A X11 based clock should
+popup in your client session:
+
+   ```bash
+   xclock
+   ```
--- a/pages/merlin6/02-How-To-Use-Merlin/connect-from-macos.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/connect-from-macos.md
@ -0,0 +1,61 @@
+---
+title: Connecting from a MacOS Client
+
+#tags:
+keywords: MacOS, connecting, client, configuration, SSH, X11
+last_updated: 23 Oct 2019
+summary: "This document describes a recommended setup for a MacOS client."
+sidebar: merlin6_sidebar
+permalink: /merlin6/connect-from-macos.html
+---
+
+## SSH without X11 Forwarding
+
+This is the standard method. Official X11 support is provided through [NoMachine](/merlin6/nomachine.html).
+For normal SSH sessions, use your SSH client as follows:
+
+```bash
+ssh $username@merlin-l-01.psi.ch
+ssh $username@merlin-l-001.psi.ch
+ssh $username@merlin-l-002.psi.ch
+```
+
+## SSH with X11 Forwarding
+
+### Requirements
+
+For running SSH with X11 Forwarding in MacOS, one needs to have a X server running in MacOS.
+The official X Server for MacOS is **[XQuartz](https://www.xquartz.org/)**. Please ensure
+you have it running before starting a SSH connection with X11 forwarding.
+
+### SSH with X11 Forwarding in MacOS
+
+Official X11 support is through NoMachine. Please follow the document
+[{Job Submission -> Interactive Jobs}](/merlin6/interactive-jobs.html#Requirements) and
+[{Accessing Merlin -> NoMachine}](/merlin6/nomachine.html) for more details. However,
+we provide a small recipe for enabling X11 Forwarding in MacOS.
+
+* Ensure that **[XQuartz](https://www.xquartz.org/)** is installed and running in your MacOS.
+
+* For enabling client X11 forwarding, add the following to the start of ``~/.ssh/config``
+to implicitly add ``-Y`` to all ssh connections:
+   
+   ```bash
+   ForwardAgent yes
+   ForwardX11Trusted yes
+   ```
+
+* Alternatively, you can add the option ``-Y`` to the ``ssh`` command. In example:
+
+   ```bash
+   ssh -Y $username@merlin-l-01.psi.ch
+   ssh -Y $username@merlin-l-001.psi.ch
+   ssh -Y $username@merlin-l-002.psi.ch
+   ```
+
+* For testing that X11 forwarding works, just run ``xclock``. A X11 based clock should
+popup in your client session.
+
+   ```bash
+   xclock
+   ```
--- a/pages/merlin6/02-How-To-Use-Merlin/connect-from-windows.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/connect-from-windows.md
@ -0,0 +1,49 @@
+---
+title: Connecting from a Windows Client
+
+#tags:
+keywords: Windows, connecting, client, configuration, SSH, X11
+last_updated: 23 Oct 2019
+summary: "This document describes a recommended setup for a Windows client."
+sidebar: merlin6_sidebar
+permalink: /merlin6/connect-from-windows.html
+---
+
+## SSH with PuTTY without X11 Forwarding
+
+PuTTY is one of the most common tools for SSH.
+
+Check, if the following software packages are installed on the Windows workstation by 
+inspecting the *Start* menu (hint: use the *Search* box to save time):
+* PuTTY (should be already installed)
+* *[Optional]* Xming (needed for [SSH with X11 Forwarding](/merlin6/connect-from-windows.html#ssh-with-x11-forwarding))
+
+If they are missing, you can install them using the Software Kiosk icon on the Desktop.
+
+1. Start PuTTY
+
+2. *[Optional]* Enable ``xterm`` to have similar mouse behavour as in Linux:
+
+   ![Enable 'xterm']({{ "/images/PuTTY/Putty_Mouse_XTerm.png" }})
+
+3. Create session to a Merlin login node and *Open*:
+
+   ![Create Merlin Session]({{ "/images/PuTTY/Putty_Session.png" }})
+
+
+## SSH with PuTTY with X11 Forwarding
+
+Official X11 Forwarding support is through NoMachine. Please follow the document
+[{Job Submission -> Interactive Jobs}](/merlin6/interactive-jobs.html#Requirements) and
+[{Accessing Merlin -> NoMachine}](/merlin6/nomachine.html) for more details. However,
+we provide a small recipe for enabling X11 Forwarding in Windows.
+
+Check, if the **Xming** is installed on the Windows workstation by inspecting the 
+*Start* menu (hint: use the *Search* box to save time). If missing, you can install it by
+using the Software Kiosk icon (should be located on the Desktop). 
+
+1. Ensure that a X server (**Xming**) is running. Otherwise, start it.
+
+2. Enable X11 Forwarding in your SSH client. In example, for Putty:
+
+   ![Enable X11 Forwarding in Putty]({{ "/images/PuTTY/Putty_X11_Forwarding.png" }})
--- a/pages/merlin6/02-How-To-Use-Merlin/nomachine.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/nomachine.md
@ -0,0 +1,117 @@
+---
+title: Remote Desktop Access
+
+#tags:
+#keywords:
+last_updated: 19 Aug 2019
+#summary: ""
+sidebar: merlin6_sidebar
+permalink: /merlin6/nomachine.html
+---
+
+Users can login in Merlin through a Linux Remote Desktop Session. NoMachine 
+is a desktop virtualization tool. It is similar to VNC, Remote Desktop, etc. 
+It uses the NX protocol to enable a graphical login to remote servers.
+
+## Installation
+
+NoMachine is available for PSI Windows computers in the Software Kiosk under the
+name **NX Client**. Please use the latest version (at least 6.0). For MacOS and
+Linux, the NoMachine client can be downloaded from https://www.nomachine.com/.
+
+## Accessing Merlin6 NoMachine from PSI
+
+The Merlin6 NoMachine service is located **only** in the following login nodes:
+
+* `merlin-l-001.psi.ch`
+* `merlin-l-002.psi.ch`
+
+Below are explained all the steps necessary for configuring the access to the 
+NoMachine service running on a login node.
+
+### Creating a Merlin6 NoMachine connection
+
+#### Creating a **New** connection
+
+Click on the **New** button to create a new connection:
+
+![Create New NoMachine Connection]({{ "/images/NoMachine/screen_nx1.png" }})
+
+#### Configuring **NX** protocol
+
+![Select NoMachine Protocol]({{ "/images/NoMachine/screen_nx2.png" }})
+
+#### Configuring NoMachine Server Information
+
+Select the corresponding login node server where the NoMachine service is running and
+keep **4000** as the listening port; then **Continue**. 
+
+![Configure NoMachine Server Information]({{ "/images/NoMachine/screen_nx3.png" }})
+
+#### Configuring NoMachine Authentication Method
+
+Choose your authentication method and **Continue**. **Password** or *Kerberos* are the recommended ones:
+
+![Configure NoMachine Authentication Method]({{ "/images/NoMachine/screen_nx4.png" }})
+
+#### Configuring Proxy
+
+In Merlin6, we will check **Don't use a proxy**, and **Continue**:
+
+![Configure NoMachine Proxy Information]({{ "/images/NoMachine/screen_nx5.png" }})
+
+#### Configuring Connection Name
+
+We strongly recommend to add the login node hostname, but
+you are free to choose any other name for your connection:
+
+![Configure Connection Name]({{ "/images/NoMachine/screen_nx6.png" }})
+
+### Connecting to Merlin6 NoMachine
+
+#### Opening an existing NoMachine connection
+
+Double click on the NoMachine server in order to connect to it:
+
+![Connect to a NoMachine existing connecion]({{ "/images/NoMachine/screen_nx7.png" }})
+
+#### Authenticating (whenever necessary)
+
+If authentication is required, you will be asked for it. The example below corresponds to **Password** 
+authentication:
+
+![NoMachine Authentication]({{ "/images/NoMachine/screen_nx8.png" }})
+
+#### Creating/Re-Connecting Virtual Desktops
+
+Finally, create a virtual desktop in order to get in. If a previous virtual desktop was created, you 
+might be able to re-attach the session. 
+
+![Create or Connect to a NoMachine Virtual Session]({{ "/images/NoMachine/screen_nx9.png" }})
+
+Some hints of how to manage the resolution and windows will be shown.
+
+![NoMachine Resolution/Window Management Hints]({{ "/images/NoMachine/screen_nx10.png" }})
+
+### NoMachine LightDM Session Example
+
+An example of the NoMachine session, which is based on [LightDM](https://github.com/canonical/lightdm)
+X Windows:
+
+![NoMachine Session: LightDM Desktop]({{ "/images/NoMachine/screen_nx11.png" }})
+
+## Accessing Merlin6 NoMachine from outside PSI
+
+### No VPN access
+
+Access to the Merlin6 NoMachine service is possible without VPN through **'rem-acc.psi.ch'**.
+Please follow the steps described in [PSI Remote Interactive Access](https://www.psi.ch/en/photon-science-data-services/remote-interactive-access) for
+remote access to the Merlin6 NoMachine services. Once logged in **'rem-acc.psi.ch'**, you must then login to one of the available MErlin6 NoMachine
+services.
+
+### VPN access
+
+Remote access is also possible through VPN, however, you **must not use 'rem-acc.psi.ch'**, and you have to connect directly
+to the Merlin6 NoMachine services as if you were inside PSI. For VPN access, you should request it to the IT department by
+opening a PSI Service Now ticket: 
+[VPN Access (PSI employees)](https://psi.service-now.com/psisp?id=psi_new_sc_cat_item&sys_id=beccc01b6f44a200d02a82eeae3ee440).
--- a/pages/merlin6/02-How-To-Use-Merlin/ssh-keys.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/ssh-keys.md
@ -0,0 +1,146 @@
+---
+title: Configuring SSH Keys in Merlin
+
+#tags:
+keywords: Linux, connecting, client, configuration, SSH, Keys, SSH-Keys, RSA
+last_updated: 15 Jul 2020
+summary: "This document describes how to deploy SSH Keys in Merlin."
+sidebar: merlin6_sidebar
+permalink: /merlin6/ssh-keys.html
+---
+
+Merlin users sometimes will need to access the different Merlin services without being constantly requested by a password.
+One can achieve that with Kerberos authentication, however in some cases some software would require the setup of SSH Keys.
+One example is ANSYS Fluent, which, when used interactively, the way of communication between the GUI and the different nodes
+is through the SSH protocol, and the use of SSH Keys is enforced.
+
+## Setting up SSH Keys on Merlin
+
+For security reason, users **must always protect SSH Keys with a passphrase**.
+
+User can check whether a SSH key already exists. These would be placed in the **~/.ssh/** directory. `RSA` encryption 
+is usually the default one, and files in there would be **`id_rsa`** (private key) and **`id_rsa.pub`** (public key).
+
+```bash
+ls ~/.ssh/id*   
+```
+
+For creating **SSH RSA Keys**, one should:
+
+1. Run `ssh-keygen`, a password will be requested twice. You **must remember** this password for the future.
+   * Due to security reasons, ***always add a password***. Never leave an empty password.
+   * This will generate a private key **id_rsa**, and a public key **id_rsa.pub** in your **~/.ssh** directory.
+2. Add your public key to the **`authorized_keys`** file, and ensure proper permissions for that file, as follows:
+   ```bash
+   cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
+   chmod 0600 ~/.ssh/authorized_keys
+   ```
+3. Configure the SSH client in order to force the usage of the **psi.ch** domain for trusting keys:
+   ```bash
+   echo "CanonicalizeHostname yes" >> ~/.ssh/config
+   ```
+
+## Using the SSH Keys
+
+### Using Authentication Agent in SSH session
+
+By default, when accessing the login node via SSH (with `ForwardAgent=yes`), it will automatically add your 
+SSH Keys to the authentication agent. Hence, no actions should not be needed by the user. One can configure
+`ForwardAgent=yes` as follows:
+
+   * **(Recommended)** In your local Linux (workstation, laptop or desktop) add the following line in the 
+   `$HOME/.ssh/config` (or alternatively in `/etc/ssh/ssh_config`) file: 
+   ```
+   ForwardAgent yes
+   ```
+   * Alternatively, on each SSH you can add the option `ForwardAgent=yes` in the SSH command. In example:
+   ```bash
+   ssh -XY -o ForwardAgent=yes merlin-l-001.psi.ch
+   ```
+
+If `ForwardAgent` is not enabled as shown above, one needs to run the authentication agent and then add your key
+to the **ssh-agent**. This must be done once per SSH session, as follows:
+
+   * Run `eval $(ssh-agent -s)` to run the **ssh-agent** in that SSH session
+   * Check whether the authentication agent has your key already added:
+   ```bash
+   ssh-add -l | grep "/psi/home/$(whoami)/.ssh"
+   ```
+   * If no key is returned in the previous step, you have to add the private key identity to the authentication agent. 
+   You will be requested for the **passphrase** of your key, and it can be done by running:
+   ```bash
+   ssh-add
+   ```
+
+### Using Authentication Agent in NoMachine Session
+
+By default, when using a NoMachine session, the `ssh-agent` should be automatically started. Hence, there is no need of
+starting the agent or forwarding it.
+
+However, for NoMachine one always need to add the private key identity to the authentication agent. This can be done as follows:
+
+1. Check whether the authentication agent has already the key added:
+   ```bash
+   ssh-add -l | grep "/psi/home/$(whoami)/.ssh"
+   ```
+2. If no key is returned in the previous step, you have to add the private key identity to the authentication agent. 
+You will be requested for the **passphrase** of your key, and it can be done by running:
+   ```bash
+   ssh-add
+   ```
+
+You just need to run it once per NoMachine session, and it would apply to all terminal windows within that NoMachine session.
+
+## Troubleshooting
+
+### Errors when running 'ssh-add'
+
+If the error `Could not open a connection to your authentication agent.` appears when running `ssh-add`, it means
+that the authentication agent is not running. Please follow the previous procedures for starting it.
+
+### Add/Update SSH RSA Key password
+
+If an existing SSH Key does not have password, or you want to update an existing password with a new one, you can do it as follows:
+
+```bash
+ssh-keygen -p -f ~/.ssh/id_rsa
+```
+
+### SSH Keys deployed but not working
+
+Please ensure proper permissions of the involved files, as well as any typos in the file names involved:
+
+```bash
+chmod u+rwx,go-rwx,g+s ~/.ssh
+chmod u+rw-x,go-rwx    ~/.ssh/authorized_keys
+chmod u+rw-x,go-rwx    ~/.ssh/id_rsa
+chmod u+rw-x,go+r-wx   ~/.ssh/id_rsa.pub
+```
+
+### Testing SSH Keys
+
+Once SSH Key is created, for testing that the SSH Key is valid, one can do the following:
+
+1. Create a **new** SSH session in one of the login nodes:
+   ```bash
+   ssh merlin-l-001
+   ```
+2. In the login node session, destroy any existing Kerberos ticket or active SSH Key:
+   ```bash
+   kdestroy
+   ssh-add -D
+   ```
+3. Add the new private key identity to the authentication agent. You will be requested by the passphrase.
+   ```bash
+   ssh-add
+   ```
+4. Check that your key is active by the SSH agent:
+   ```bash
+   ssh-add -l
+   ```
+4. SSH to the second login node. No password should be requested:
+   ```bash
+   ssh -vvv merlin-l-002
+   ```
+
+If the last step succeeds, then means that your SSH Key is properly setup.
--- a/pages/merlin6/02-How-To-Use-Merlin/storage.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/storage.md
@ -0,0 +1,187 @@
+---
+title: Merlin6 Storage
+#tags:
+#keywords:
+last_updated: 28 June 2019
+#summary: ""
+sidebar: merlin6_sidebar
+redirect_from: /merlin6/data-directories.html
+permalink: /merlin6/storage.html
+---
+
+## Introduction
+
+This document describes the different directories of the Merlin6 cluster.
+
+### Checking user quota
+
+For each directory, we provide a way for checking quotas (when required). However, a single command ``merlin_quotas``
+is provided. This is useful to show with a single command all quotas for your filesystems (including AFS, which is not mentioned here).
+
+To check your quotas, please run:
+
+```bash
+merlin_quotas
+```
+
+## Merlin6 directories
+
+Merlin6 offers the following directory classes for users:
+
+* ``/psi/home/<username>``: Private user **home** directory
+* ``/data/user/<username>``: Private user **data** directory
+* ``/data/project/general/<projectname>``: Shared **Project** directory
+   * For BIO experiments, a dedicated ``/data/project/bio/$projectname`` exists.
+* ``/scratch``: Local *scratch* disk (only visible by the node running a job).
+* ``/shared-scratch``: Shared *scratch* disk (visible from all nodes).
+* ``/export``: Export directory for data transfer, visible from `ra-merlin-01.psi.ch`, `ra-merlin-02.psi.ch` and Merlin login nodes.
+   * Refer to **[Transferring Data](/merlin6/transfer-data.html)** for more information about the export area and data transfer service.
+
+{{site.data.alerts.tip}}In GPFS there is a concept called <b>GraceTime</b>. Filesystems have a block (amount of data) and file (number of files) quota. 
+This quota contains a soft and hard limits. Once the soft limit is reached, users can keep writing up to their hard limit quota during the <b>grace period</b>.
+Once <b>GraceTime</b> or hard limit are reached, users will be unable to write and will need remove data below the soft limit (or ask for a quota increase
+when this is possible, see below table).
+{{site.data.alerts.end}}
+
+Properties of the directory classes:
+
+| Directory                          | Block Quota [Soft:Hard] | Block Quota [Soft:Hard] | GraceTime |  Quota Change Policy: Block        | Quota Change Policy: Files       | Backup |  Backup Policy                 |
+| ---------------------------------- | ----------------------- | ----------------------- | :-------: | :--------------------------------- |:-------------------------------- | ------ | :----------------------------- |
+| /psi/home/$username                | USR [10GB:11GB]         | *Undef*                 |  N/A      |  Up to x2 when strongly justified. | N/A                              | yes    | Daily snapshots for 1 week     |
+| /data/user/$username               | USR [1TB:1.074TB]       | USR [1M:1.1M]           |  7d       |  Inmutable. Need a project.        | Changeable when justified.       | no     | Users responsible for backup   |
+| /data/project/bio/$projectname     | GRP [1TB:1.074TB]       | GRP [1M:1.1M]           |  7d       |  Subject to project requirements.  | Subject to project requirements. | no     | Project responsible for backup |
+| /data/project/general/$projectname | GRP [1TB:1.074TB]       | GRP [1M:1.1M]           |  7d       |  Subject to project requirements.  | Subject to project requirements. | no     | Project responsible for backup |
+| /scratch                           | *Undef*                 | *Undef*                 |  N/A      |  N/A                               | N/A                              | no     | N/A                            |
+| /shared-scratch                    | USR [512GB:2TB]         | USR [2M:2.5M]           |  7d       |  Up to x2 when strongly justified. | Changeable when justified.       | no     | N/A                            |
+| /export                            | USR [10MB:20TB]         | USR [512K:5M]           |  10d      |  Soft can be temporary increased.  | Changeable when justified.       | no     | N/A                            |
+
+{{site.data.alerts.warning}}The use of <b>scratch</b> and <b>export</b> areas as an extension of the quota <i>is forbidden</i>. <b>scratch</b> and <b>export</b> areas <i>must not contain</i> final data. 
+<br><b><i>Auto cleanup policies</i></b> in the <b>scratch</b> and <b>export</b> areas are applied.
+{{site.data.alerts.end}}
+
+### User home directory
+
+This is the default directory users will land when login in to any Merlin6 machine.
+It is intended for your scripts, documents, software development, and other files which
+you want to have backuped. Do not use it for data or HPC I/O-hungry tasks.
+
+This directory is mounted in the login and computing nodes under the path:
+
+```bash
+/psi/home/$username
+```
+
+Home directories are part of the PSI NFS Central Home storage provided by AIT and
+are managed by the Merlin6 administrators.
+
+Users can check their quota by running the following command:
+
+```bash
+quota -s
+```
+
+#### Home directory policy
+
+* Read **[Important: Code of Conduct](## Important: Code of Conduct)** for more information about Merlin6 policies.
+* Is **forbidden** to use the home directories for IO intensive tasks
+   * Use ``/scratch``, ``/shared-scratch``, ``/data/user`` or ``/data/project`` for this purpose.
+* Users can retrieve up to 1 week of their lost data thanks to the automatic **daily snapshots for 1 week**.
+Snapshots can be accessed at this path:
+
+   ```bash
+   /psi/home/.snapshop/$username
+   ```
+
+### User data directory
+
+The user data directory is intended for *fast IO access* and keeping large amounts of private data.
+This directory is mounted in the login and computing nodes under the directory
+
+ ```bash
+/data/user/$username
+```
+
+Users can check their quota by running the following command:
+
+```bash
+mmlsquota -u <username> --block-size auto merlin-user
+```
+
+#### User data directory policy
+
+* Read **[Important: Code of Conduct](## Important: Code of Conduct)** for more information about Merlin6 policies.
+* Is **forbidden** to use the data directories as ``scratch`` area during a job runtime.
+   * Use ``/scratch``, ``/shared-scratch`` for this purpose.
+* No backup policy is applied for user data directories: users are responsible for backing up their data.
+
+### Project data directory
+
+This storage is intended for *fast IO access* and keeping large amounts of a project's data, where the data also can be
+shared by all members of the project (the project's corresponding unix group). We recommend to keep most data in
+project related storage spaces, since it allows users to coordinate. Also, project spaces have more flexible policies
+regarding extending the available storage space.
+
+Experiments can request a project space as described in **[[Accessing Merlin -> Requesting a Project]](/merlin6/request-project.html)**
+
+Once created, the project data directory will be mounted in the login and computing nodes under the dirctory:
+
+```bash
+/data/project/general/$projectname
+```
+
+Project quotas are defined on a per *group* basis. Users can check the project quota by running the following command:
+
+```bash
+mmlsquota -j $projectname --block-size auto -C merlin.psi.ch merlin-proj
+```
+
+#### Project Directory policy
+
+* Read **[Important: Code of Conduct](## Important: Code of Conduct)** for more information about Merlin6 policies.
+* It is **forbidden** to use the data directories as ``scratch`` area during a job's runtime, i.e. for high throughput I/O for a job's temporary files. Please Use ``/scratch``, ``/shared-scratch`` for this purpose.
+* No backups: users are responsible for managing the backups of their data directories.
+
+### Scratch directories
+
+There are two different types of scratch storage: **local** (``/scratch``) and **shared** (``/shared-scratch``).
+
+**local** scratch should be used for all jobs that do not require the scratch files to be accessible from multiple nodes, which is trivially
+true for all jobs running on a single node.
+**shared** scratch is intended for files that need to be accessible by multiple nodes, e.g. by a MPI-job where tasks are spread out over the cluster
+and all tasks need to do I/O on the same temporary files.
+
+**local** scratch in Merlin6 computing nodes provides a huge number of IOPS thanks to the NVMe technology. **Shared** scratch is implemented using a distributed parallel filesystem (GPFS) resulting in a higher latency, since it involves remote storage resources and more complex I/O coordination.
+
+``/shared-scratch`` is only mounted in the *Merlin6* computing nodes (i.e. not on the login nodes), and its current size is 50TB. This can be increased in the future.
+
+The properties of the available scratch storage spaces are given in the following table
+
+| Cluster | Service        | Scratch      | Scratch Mountpoint | Shared Scratch | Shared Scratch Mountpoint | Comments                               |
+| ------- | -------------- | ------------ | ------------------ | -------------- | ------------------------- | -------------------------------------- |
+| merlin5 | computing node | 50GB  / SAS  | ``/scratch``       | ``N/A``        | ``N/A``                   | ``merlin-c-[01-64]``                   |
+| merlin6 | login node     | 100GB / SAS  | ``/scratch``       | 50TB / GPFS    | ``/shared-scratch``       | ``merlin-l-0[1,2]``                    |
+| merlin6 | computing node | 1.3TB / NVMe | ``/scratch``       | 50TB / GPFS    | ``/shared-scratch``       | ``merlin-c-[001-024,101-124,201-224]`` |
+| merlin6 | login node     | 2.0TB / NVMe | ``/scratch``       | 50TB / GPFS    | ``/shared-scratch``       | ``merlin-l-00[1,2]``                   |
+
+#### Scratch directories policy
+
+* Read **[Important: Code of Conduct](## Important: Code of Conduct)** for more information about Merlin6 policies.
+* By default, *always* use **local** first and only use **shared** if your specific use case requires it.
+* Temporary files *must be deleted at the end of the job by the user*.
+   * Remaining files will be deleted by the system if detected.
+      * Files not accessed within 28 days will be automatically cleaned up by the system.
+      * If for some reason the scratch areas get full, admins have the rights to cleanup the oldest data.
+
+### Export directory
+
+Export directory is exclusively intended for transferring data from outside PSI to Merlin and viceversa. Is a temporary directoy with an auto-cleanup policy.
+Please read **[Transferring Data](/merlin6/transfer-data.html)** for more information about it.
+
+#### Export directory policy
+
+* Temporary files *must be deleted at the end of the job by the user*.
+   * Remaining files will be deleted by the system if detected.
+      * Files not accessed within 28 days will be automatically cleaned up by the system.
+      * If for some reason the export area gets full, admins have the rights to cleanup the oldest data
+
+---
--- a/pages/merlin6/02-How-To-Use-Merlin/transfer-data.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/transfer-data.md
@ -0,0 +1,131 @@
+---
+title: Transferring Data
+#tags:
+#keywords:
+last_updated: 9 July 2019
+#summary: ""
+sidebar: merlin6_sidebar
+permalink: /merlin6/transfer-data.html
+---
+
+## Transferring Data from the PSI Network to/from Merlin6
+
+### Rsync
+
+Rsync is the preferred method to transfer data from Linux/MacOS. It allows
+transfers to be easily resumed if they get interrupted. The general syntax is:
+
+```
+rsync -avAHXS <src> <dst>
+```
+
+For example, to transfer files from your local computer to a merlin project
+directory:
+
+```
+rsync -avAHXS ~/localdata user@merlin-l-01.psi.ch:/data/project/general/myproject/
+```
+
+You can resume interrupted transfers by simply rerunning the command. Previously
+transferred files will be skipped.
+
+
+### WinSCP
+
+The WinSCP tool can be used for remote file transfer on Windows. It is available
+from the Software Kiosk on PSI machines. Add `merlin-l-01.psi.ch` as a host and
+connect with your PSI credentials. You can then drag-and-drop files between your
+local computer and merlin.
+
+## Transferring Data to/from outside PSI
+
+Two servers are enabled for exporting data from Merlin to outside PSI. 
+These Remote Access Merlin servers are the following:
+* **'ra-merlin-01.psi.ch'**: standard password authentication (with PSI password)
+   * `/data/user` mounted in RO (read-only)
+   * `/export` directory in RW (read-write). `/export` is also visible from login nodes.
+* **'ra-merlin-02.psi.ch'**: ***Two factor authentication*** (2FA), required **RSA SecurID** token (same as VPN)
+   * `/data/project` directories mounted in RW on demand. Project responsibles must request it.
+   * `/data/user` mounted in RW (read-write)
+   * `/export` directory in RW (read-write). `/export` is also visible from login nodes.
+
+In the future, **'ra-merlin-01.psi.ch'** will be also configured with 2FA and will mount the same 
+as **'ra-merlin-02.psi.ch'**. In the meantime, we keep **'ra-merlin-01.psi.ch'** with standard authentication 
+until we can ensure that most of the Merlin users have a RSA SecurID token or until PSI security policy makes
+its use mandatory. Using **'ra-merlin-02.psi.ch'** over **'ra-merlin-01.psi.ch'** is always recommended (2FA
+is always more secure than standard authentication)
+
+### Directories
+
+#### /data/user
+
+User data directories are mounted in RO on 'ra-merlin-01', and RW on 'ra-merlin-02'.
+
+{{site.data.alerts.warning}}Please, <b>ensure proper secured permissions</b> in your '/data/user' 
+directory. By default, when directory is created, the system applies the most restrictive 
+permissions. However, this does not prevent users for changing permissions if they wish. At this 
+point, users become responsible of those changes.
+{{site.data.alerts.end}}
+
+#### /export
+
+Transferring big amounts of data from outside PSI to Merlin is always possible through `/export`.
+
+{{site.data.alerts.tip}}<b>The '/export' directory can be used by any Merlin user.</b>
+This is configured in Read/Write mode. If you need access, please, contact the Merlin administrators.
+{{site.data.alerts.end}}
+
+{{site.data.alerts.warning}}The use <b>export</b> as an extension of the quota <i>is forbidden</i>. 
+<br><b><i>Auto cleanup policies</i></b> in the <b>export</b> area apply for files older than 28 days.
+{{site.data.alerts.end}}
+
+##### Exporting data from Merlin
+
+For exporting data from Merlin to outside PSI by using `/export`, one has to:
+   * From a Merlin login node, copy your data from any directory (i.e. `/data/project`, `/data/user`, `/scratch`) to
+`/export`. Ensure to properly secure your directories and files with proper permissions.
+   * Once data is copied, from **ra-merlin-01.psi.ch** or **ra-merlin-02.psi.ch**, copy the data from `/export` to outside PSI.
+
+##### Importing data to Merlin
+
+For importing data from outside PSI to Merlin by using `/export`, one has to:
+   * From **ra-merlin-01.psi.ch** or **ra-merlin-02.psi.ch**, copy the data from outside PSI to `/export`.
+Ensure to properly secure your directories and files with proper permissions.
+   * Once data is copied, from a Merlin login node, copy your data from `/export` to any directory (i.e. `/data/project`, `/data/user`, `/scratch`). 
+
+#### /data/project
+
+Optionally, instead of using `/export`, experiments with a Merlin project can request Read/Write or Read/Only access to their project directory.
+
+{{site.data.alerts.tip}}<b>Merlin projects can request direct access on 'ra-merlin-02.psi.ch'</b>
+This can be configured in Read/Write or Read/Only modes. If your project needs access, please,
+contact the Merlin administrators.
+{{site.data.alerts.end}}
+
+### Accepted protocols
+
+Accepted protocols for Remote Access Merlin servers are the following:
+* **sftp**: **``sftp``** command or similar X11/Windows/MacOS based programs.
+* **ssh**: **`scp`** command (as well as **WinSCP** and similar programs) or **`rsync`** command
+* **~~Globus Online~~**: ***not available yet.***
+
+### Remote Access Servers Policies
+
+SSH is one of the allowed protocols. 
+* Please, **absolutely never** use this servers as a login node.
+* Please avoid copying files to the *home* directories. 
+* Please **never use SSH Keys** for accessing these servers. Accessing through SSH keys will be denied in the upcomig months.
+
+Only ``/data/user`, `/data/project` and `/export` directories should be used on these nodes, 
+and exclusively for transferring data from/to PSI to/from outside PSI.
+
+## Connecting to Merlin6 from outside PSI
+
+Merlin6 is fully accessible from within the PSI network. To connect from outside you can use:
+
+- [VPN](https://www.psi.ch/en/computing/vpn) ([alternate instructions](https://intranet.psi.ch/BIO/ComputingVPN))
+- [SSH hop](https://www.psi.ch/en/computing/ssh-hop)
+   * Please avoid transferring big amount data through **hop**
+- [No Machine](nomachine.md)
+   * Remote Interactive Access through [**'rem-acc.psi.ch'**](https://www.psi.ch/en/photon-science-data-services/remote-interactive-access)
+   * Please avoid transferring big amount of data through **NoMachine**
--- a/pages/merlin6/02-How-To-Use-Merlin/using-modules.md
+++ b/pages/merlin6/02-How-To-Use-Merlin/using-modules.md
@ -0,0 +1,64 @@
+---
+title: Using PModules 
+#tags:
+#keywords:
+last_updated: 20 June 2019
+#summary: ""
+sidebar: merlin6_sidebar
+permalink: /merlin6/using-modules.html
+---
+
+## Environment Modules
+
+On top of the operating system stack we provide different software using the PSI developed PModule system. 
+
+PModules is the official supported way and each package is deployed by a specific expert. Usually, in PModules
+software which is used by many people will be found.
+
+If you miss any package/versions or a software with a specific missing feature, contact us. We will study if is feasible or not to install it.
+
+### Basic commands:
+
+Basic generic commands would be:
+
+```bash
+module avail                     # to see the list of available software packages provided via pmodules
+module use unstable              # to get access to a set of packages not fully tested by the community
+module load <package>/<version>  # to load specific software package with a specific version
+module search <string>           # to search for a specific software package and its dependencies.
+module list                      # to list which software is loaded in your environment
+module purge                     # unload all loaded packages and cleanup the environment
+```
+
+Also, you can load multiple packages at once. This can be useful for instance when loading a package with its dependencies:
+
+```bash
+# Single line
+module load gcc/9.2.0 openmpi/3.1.5-1_merlin6
+
+# Multiple line
+module load gcc/9.2.0
+module load openmpi/3.1.5-1_merlin6
+```
+
+In the example above, we load ``openmpi/3.1.5-1_merlin6`` but we also specify ``gcc/9.2.0`` which is a strict dependency. The dependency must be
+loaded in advance.
+
+---
+
+## When to request for new PModules packages
+
+### Missing software
+
+If you don't find a specific software and you know from other people interesing on it, it can be installed in PModules. Please contact us
+and we will try to help with that. Deploying new software in PModules may take few days.
+
+Usually installation of new software are possible as long as few users will use it. If you are insterested in to maintain this software,
+please let us know.
+
+### Missing version
+
+If the existing PModules versions for a specific package do not fit to your needs, is possible to ask for a new version.
+
+Usually installation of newer versions will be supported, as long as few users will use it. Installation of intermediate versions can
+be supported if this is strictly justified.