Workspaces/Advanced Features: Difference between revisions

From bwHPC Wiki
Jump to navigation Jump to search
mNo edit summary
Line 88: Line 88:
Use <tt>ws_list -l</tt> or <tt>ws_find -l</tt> to see available filesystem locations on your cluster.
Use <tt>ws_list -l</tt> or <tt>ws_find -l</tt> to see available filesystem locations on your cluster.


== Detailed ws_allocate Options ==
== ws_allocate - Create and Extend Workspaces ==


{| class="wikitable"
{| class="wikitable"
Line 180: Line 180:
For more information read the program's help: <tt>$ ws_allocate -h</tt> or <tt>man ws_allocate</tt>
For more information read the program's help: <tt>$ ws_allocate -h</tt> or <tt>man ws_allocate</tt>


== Advanced ws_list Options ==
== ws_list - List Workspaces ==


Beyond the basic options shown in the main [[Workspaces]] guide, <tt>ws_list</tt> supports additional sorting and filtering:
Beyond the basic options shown in the main [[Workspaces]] guide, <tt>ws_list</tt> supports additional sorting and filtering:
Line 199: Line 199:
This shows all workspaces that were created with <tt>-g</tt> or <tt>-G</tt> flags by members of your group.
This shows all workspaces that were created with <tt>-g</tt> or <tt>-G</tt> flags by members of your group.


To list expired workspaces, see [[#Restore_an_Expired_Workspace|Restore an Expired Workspace]].
'''Note:''' To list expired workspaces that can be restored, use <tt>ws_restore -l</tt>. See [[#ws_restore_-_Restore_Expired_Workspace|ws_restore]] for details.


For more information: <tt>$ ws_list -h</tt> or <tt>man ws_list</tt>
For more information: <tt>$ ws_list -h</tt> or <tt>man ws_list</tt>


== Advanced ws_find Options ==
== ws_find - Find Workspace Path ==


<tt>ws_find</tt> can search workspaces on specific filesystems:
<tt>ws_find</tt> can search workspaces on specific filesystems:
Line 210: Line 210:
* <tt>-l</tt> - List valid filesystem names
* <tt>-l</tt> - List valid filesystem names


== Advanced ws_extend Options ==
== ws_extend - Extend Workspace Lifetime ==


Beyond the basic extension shown in the main [[Workspaces]] guide:
Beyond the basic extension shown in the main [[Workspaces]] guide:
Line 222: Line 222:
'''Update reminder only''' (without extending):
'''Update reminder only''' (without extending):


$ ws_allocate -r <days> -x <workspace> 0 # Update reminder time
$ ws_allocate -r <days> -x <workspace> 0 # Update reminder time
$ ws_allocate -r <days> -u <username> -x <workspace> 0 # Update reminder time and take over
$ ws_allocate -r <days> -u <username> -x <workspace> 0 # Update reminder time and take over
# another user's workspace reminders
# another user's workspace reminders


== Getting Reminders ==
== Getting Reminders ==
Line 261: Line 261:
$ ws_send_ical <workspace> <email>
$ ws_send_ical <workspace> <email>


'''User defaults''' in <tt>~/.ws_user.conf</tt> (YAML format):
=== ~/.ws_user.conf Configuration File ===

<pre>
mail: reach@me.here # Optional - only if you want to override the email from identity provider
duration: 10 # Default workspace lifetime
reminder: 3 # Days before expiration to send reminder
groupname: mygroup # Default group for -G option
</pre>

=== Example ~/.ws_user.conf Configuration ===


{| class="wikitable"
{| class="wikitable"
Line 289: Line 280:
|}
|}


You can set defaults in <tt>~/.ws_user.conf</tt> (YAML format) to avoid typing the same options repeatedly:
Here's a complete example configuration file that sets useful defaults:

'''Important:''' Some versions of the workspace tools mistakenly interpret a leading comment line in <tt>~/.ws_user.conf</tt> as an email address. Make sure the '''first line does not start with <tt>#</tt>'''. Start with a setting like <tt>duration: 30</tt>; inline end-of-line comments are fine.


<pre>
<pre>
duration: 30 # Default workspace lifetime (first line must not start with #)
duration: 30 # Default workspace lifetime (first line must not start with #!)
reminder: 5 # Days before expiration to send reminder
# ~/.ws_user.conf - Workspace tool defaults
groupname: projectgroup # Default group for -G option (e.g., bw11a000)
# This file uses YAML syntax
# mail: custom@example.com # Optional - only to override email from identity provider

# Email address for expiration reminders (optional)
# If not set, email from your identity provider is used automatically
# Only uncomment this line if you need to use a different email address
# mail: custom.email@example.com

# Reminder timing in days before expiration
# You will receive an email this many days before workspace expires
# Recommended: 3-7 days to give yourself time to extend or backup data
reminder: 5

# Default group name for collaborative workspaces
# If you always work with the same group, set this to avoid typing -G groupname
# Uncomment and set your group name below:
# groupname: bw11a000
</pre>
</pre>


'''Important:''' Some versions mistakenly interpret a leading <tt>#</tt> comment as email. '''First line must be a setting''', not a comment. Inline comments are fine.
'''To create this file:'''

$ nano ~/.ws_user.conf

Then paste the configuration above and adjust the values to your needs.


'''Benefits of using ~/.ws_user.conf:'''
'''Benefits:'''
* Simplifies commands: <tt>ws_allocate myWs</tt> instead of <tt>ws_allocate -r 5 -G groupname myWs 30</tt>
* Avoid typing the same options repeatedly
* Ensures consistent settings across all workspace operations
* Ensures consistent settings across all operations
* Automatic group collaboration when <tt>groupname</tt> is set
* Simplifies commands: <tt>ws_allocate myWs</tt> instead of <tt>ws_allocate -r 5 myWs 30</tt>
* Automatic group collaboration setup when <tt>groupname</tt> is set


== Cooperative Usage (Group Workspaces and Sharing) ==
== Cooperative Usage (Group Workspaces and Sharing) ==
Line 591: Line 561:
Use manual <tt>chmod</tt>/<tt>chgrp</tt> only when <tt>-G</tt> is not available on your cluster or for fixing permissions on existing data.
Use manual <tt>chmod</tt>/<tt>chgrp</tt> only when <tt>-G</tt> is not available on your cluster or for fixing permissions on existing data.


== Delete a Workspace ==
== ws_release - Release (Delete) Workspace ==


To release a workspace, use:
To release a workspace, use:
Line 601: Line 571:
'''What happens when you release:'''
'''What happens when you release:'''
* The workspace ID can be reused immediately
* The workspace ID can be reused immediately
* The directory becomes inaccessible
* The directory becomes inaccessible to the user
* The data is '''not deleted immediately''' - it is kept for a grace period
* The data is '''not deleted immediately''' - it is moved to a deleted area
* The data can be recovered using <tt>ws_restore</tt> during the grace period
* '''Released workspaces:''' Can be recovered using <tt>ws_restore</tt> for a short grace period (typically 1 hour after release)
* '''Expired workspaces:''' Can be recovered for a longer period (system-dependent, typically 7-30 days after expiration, configured via <tt>keeptime</tt> in <tt>/etc/ws.conf</tt>)
* Final deletion typically happens during nighttime maintenance
* Final deletion happens automatically during cleanup runs (usually nighttime)
* '''Important:''' Released workspace data may still count toward your quota until final deletion occurs
* '''Important:''' Released workspace data may still count toward your quota until final deletion occurs


Line 653: Line 624:
'''Option 2: Manual deletion before release''' (works on all clusters):
'''Option 2: Manual deletion before release''' (works on all clusters):


See the section on [[#Using_rm_for_Manual_Deletion|using rm for manual deletion]] below for the safe method.
$ rm -rf -- "$(ws_find myWs)"
$ ws_release myWs

'''Why this is safe:'''
* <tt>--</tt> prevents path from being interpreted as option (even if it starts with <tt>-</tt>)
* Quotes around <tt>$(ws_find myWs)</tt> handle paths with spaces safely
* If <tt>ws_find</tt> fails or returns empty, <tt>rm</tt> fails with "missing operand" (no deletion occurs)
* Direct and simple - no complex patterns or subshells needed


'''CRITICAL WARNING:''' Both methods permanently delete data that '''cannot be recovered''', even by system administrators.
'''CRITICAL WARNING:''' Both methods permanently delete data that '''cannot be recovered''', even by system administrators.
Line 673: Line 637:
'''Best practice:''' Use regular <tt>ws_release myWs</tt> (without immediate deletion) as your default. Only use immediate deletion when you're absolutely certain and need immediate quota relief.
'''Best practice:''' Use regular <tt>ws_release myWs</tt> (without immediate deletion) as your default. Only use immediate deletion when you're absolutely certain and need immediate quota relief.


=== Using rm for Manual Deletion ===
== Restore an Expired Workspace ==

If <tt>ws_release --delete-data</tt> is not available on your cluster, you can manually delete data before releasing:

$ rm -rf -- "$(ws_find myWs)"
$ ws_release myWs

'''Why this is safe:'''
* <tt>--</tt> prevents path from being interpreted as option (even if it starts with <tt>-</tt>)
* Quotes around <tt>$(ws_find myWs)</tt> handle paths with spaces safely
* If <tt>ws_find</tt> fails or returns empty, <tt>rm</tt> fails with "missing operand" (no deletion occurs)
* Direct and simple - no complex patterns or subshells needed

== ws_restore - Restore Expired Workspace ==


{| class="wikitable"
{| class="wikitable"
Line 779: Line 756:
=== Cluster-Specific Information ===
=== Cluster-Specific Information ===


Retention periods for expired/released workspaces vary by cluster:
Grace periods for expired/released workspaces vary by cluster and are configured via the <tt>keeptime</tt> parameter in <tt>/etc/ws.conf</tt>:


{| class="wikitable"
{| class="wikitable"
|-
|-
!style="width:30%" | Cluster
!style="width:30%" | Cluster
!style="width:35%" | Expired Workspace Retention
!style="width:35%" | Expired Workspace Retention (<tt>keeptime</tt>)
!style="width:35%" | Released Workspace Retention
!style="width:35%" | Released Workspace Grace Period
|-
|-
|bwUniCluster 3.0
|bwUniCluster 3.0
|30 days
|30 days
|30 days
|~1 hour
|-
|-
|Helix
|Helix
|System-specific grace period
|System-specific (check with support)
|~1 hour
|System-specific grace period
|-
|-
|JUSTUS 2
|JUSTUS 2
|System-specific grace period
|System-specific (check with support)
|~1 hour
|System-specific grace period
|-
|-
|BinAC2
|BinAC2
|System-specific grace period
|System-specific (check with support)
|~1 hour
|System-specific grace period
|-
|-
|NEMO2
|NEMO2
|30 days
|30 days
|30 days
|~1 hour
|}
|}

'''Note:'''
* '''Expired workspaces''' (reached end of lifetime) are kept for <tt>keeptime</tt> days configured in <tt>/etc/ws.conf</tt>
* '''Released workspaces''' (via <tt>ws_release</tt>) are deleted faster - typically within 1 hour
* '''Actual deletion''' happens during automated cleanup runs, typically during nighttime
* This allows users to recover accidentally released workspaces quickly, while giving more time for expired workspaces


== Cluster-Specific Workspace Limits ==
== Cluster-Specific Workspace Limits ==


Different clusters have different workspace policies. Below is an overview of typical settings:
Different clusters have different workspace policies configured in <tt>/etc/ws.conf</tt>. Below is an overview of typical settings:


{| class="wikitable"
{| class="wikitable"
Line 857: Line 840:
|}
|}


'''Configuration parameters in <tt>/etc/ws.conf</tt>:'''
'''Note:''' Check your specific cluster documentation for current quotas, as they may change. Use <tt>ws_list -l</tt> to see available filesystems and their properties on your cluster.
* <tt>duration</tt> - Maximum lifetime in days
* <tt>maxextensions</tt> - Number of times a workspace can be extended
* <tt>keeptime</tt> - Days to keep expired workspaces before final deletion (typically 7-30 days)
* Released workspaces have a shorter grace period (~1 hour) regardless of <tt>keeptime</tt>

'''Note:''' Check your specific cluster documentation for current quotas and settings, as they may change. Use <tt>ws_list -l</tt> to see available filesystems and their properties on your cluster.


== Checking Workspace Quotas ==
== Checking Workspace Quotas ==
Line 883: Line 872:
'''General tip:''' Always check disk usage before large data operations to ensure sufficient space is available.
'''General tip:''' Always check disk usage before large data operations to ensure sufficient space is available.


== Register Workspace Links ==
== ws_register - Register Workspace Links ==


The <tt>ws_register</tt> command creates or updates symbolic links to your workspaces in a directory of your choice. This provides a convenient way to access all your workspaces from a single location.
The <tt>ws_register</tt> command creates or updates symbolic links to your workspaces in a directory of your choice. This provides a convenient way to access all your workspaces from a single location.
Line 893: Line 882:
$ ws_register ~/workspaces
$ ws_register ~/workspaces


his will create symbolic links to all your workspaces in the <tt>~/workspaces</tt> directory. The command creates subdirectories for each filesystem (e.g., <tt>classic/</tt>) and places workspace links inside them.
This will create symbolic links to all your workspaces in the <tt>~/workspaces</tt> directory. The command creates subdirectories for each filesystem (e.g., <tt>scratch/</tt>) and places workspace links inside them.


Example:
Example:

Revision as of 15:38, 21 November 2025

This document covers advanced features and detailed configuration options for the HPC workspace tools. For basic daily usage, see the main Workspaces guide.

Almost Complete Command Reference

Task Command
Create workspace for 30 days ws_allocate myWs 30
Create with custom email ws_allocate -m custom@example.com -r 3 myWs 30
Create group-writable workspace ws_allocate -G groupname myWs 30
Create on specific filesystem ws_allocate -F filesystem myWs 30
List all your workspaces ws_list
List by remaining time ws_list -R
List available filesystems ws_list -l or ws_find -l
Find workspace path ws_find myWs
Extend workspace by 40 days ws_extend myWs 40 or ws_allocate -x myWs 40
Share with another user ws_share share myWs username
List shared users ws_share list myWs
Send calendar reminder ws_send_ical myWs user@example.com
Release workspace ws_release myWs
List restorable workspaces ws_restore -l
Register workspace links ws_register ~/workspaces

Multiple Filesystem Locations

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
-F option (multiple filesystems)

Some clusters offer multiple filesystem locations for workspaces with different characteristics:

bwUniCluster 3.0:

  • Default workspace filesystem (Lustre)
  • Flash-based workspace filesystem (ffuc) - for KIT/HoreKa users only
    • Lower latency and better performance for small files
    • SSDs instead of hard disks
    • Shared between bwUniCluster 3.0 and HoreKa

Example creating workspace on flash filesystem:

  $ ws_allocate -F ffuc myworkspace 60

Use ws_list -l or ws_find -l to see available filesystem locations on your cluster.

ws_allocate - Create and Extend Workspaces

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
ws_allocate -x -u (extend other user's workspace)

Basic Usage

Execution of:

  $ ws_allocate myWs 30

e.g. returns:

  Workspace created. Duration is 720 hours. 
  Further extensions available: 3
  /work/workspace/scratch/username-myWs-0

The command returns the path to the new directory, which can be captured in a variable:

  $ SCRDIR=$(ws_allocate myWs 10)

Important: Creating a workspace a second time with the same command is a no-operation - it always returns the same path. This makes it safe and encouraged to use such a line in batch jobs which are part of a series of jobs working on the same data, no matter if the job was running before or not.

All Options and When to Use Them

Option Description When to Use
-F <filesystem> Specify the filesystem where the workspace should be created Optional - Most clusters have only one default filesystem. Use only when you need specific storage characteristics (speed, capacity) or to balance load across multiple filesystems. List available locations with ws_list -l or ws_find -l
-g Create a group-readable workspace Recommended when working in a team and others need to read your data. The workspace will be visible to group members with ws_list -g
-G <groupname> Create a group-writable workspace with sticky bit Recommended for collaborative work where team members need to write data. Ensures all created files belong to the group. Can be set as default in ~/.ws_user.conf
-m <mailaddress> Set email address for reminders Optional - Email addresses come from your identity provider. Only use this option to override with a different address. Can be set as default in ~/.ws_user.conf
-r <days> Set reminder to be sent n days before expiration Optional - Email reminders are sent automatically. Use this only to customize when the reminder starts (e.g., 3, 5, or 7 days before). Can be set as default in ~/.ws_user.conf
-x Extend an existing workspace Use when you need more time. Each extension consumes one of the available extensions
-u <username> Used with -x to extend another user's workspace Use when a group member is absent and their shared workspace needs extension. Requires group write access -G
-c <comment> Add a comment to the workspace Use to document the purpose of the workspace for yourself and collaborators
-d <duration> Duration in days (alternative to positional argument) Use if you prefer explicit option syntax: ws_allocate -n myWs -d 30
-n <name> Workspace name (alternative to positional argument) Use if you prefer explicit option syntax: ws_allocate -n myWs -d 30

Duration Settings

If you do not specify a lifetime, a default lifetime will be used (see the Cluster-Specific Workspace Limits)). The maximum lifetime may be limited by the operations team. If you specify a longer lifetime, it will be capped to the maximum, and you will see a message that it was changed.

For more information read the program's help: $ ws_allocate -h or man ws_allocate

ws_list - List Workspaces

Beyond the basic options shown in the main Workspaces guide, ws_list supports additional sorting and filtering:

  • -N - Sort by name (alphabetical)
  • -C - Sort by creation date
  • -t - Terse format
  • -v - Verbose format with all metadata
  • -r - Reverse sort order
  • -F <location> - List workspaces on specific filesystem only
  • -g - List group workspaces (if you're in the same group)
  • -l - List available filesystem locations

Example listing group workspaces:

  $ ws_list -g

This shows all workspaces that were created with -g or -G flags by members of your group.

Note: To list expired workspaces that can be restored, use ws_restore -l. See ws_restore for details.

For more information: $ ws_list -h or man ws_list

ws_find - Find Workspace Path

ws_find can search workspaces on specific filesystems:

  • -F <filesystem> - Search workspace on specific filesystem
  • -l - List valid filesystem names

ws_extend - Extend Workspace Lifetime

Beyond the basic extension shown in the main Workspaces guide:

  • -F filesystem - Extend workspace on specific filesystem
  • ws_allocate -x - Alternative command for extension
  • You can shorten workspace lifetime even if no extensions are available
  • Group members can extend group workspaces. Requires group write access ws_allocate -G:
  ws_allocate -x -u <username> <workspace_id> <days>

Update reminder only (without extending):

  $ ws_allocate -r <days> -x <workspace> 0                 # Update reminder time
  $ ws_allocate -r <days> -u <username> -x <workspace> 0   # Update reminder time and take over
                                                           # another user's workspace reminders

Getting Reminders

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
Email reminders
ws_send_ical (calendar reminders)

Email reminders: Sent automatically using email addresses from your identity provider. You can customize the reminder timing with -r <days>:

  $ ws_allocate -r 7 myWs 30                          # Reminder 7 days before expiry
  $ ws_allocate -r 3 -m custom@example.com myWs 30    # Custom timing and different email address

Calendar reminder (bwUniCluster 3.0, Helix):

  $ ws_send_ical <workspace> <email>

~/.ws_user.conf Configuration File

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
~/.ws_user.conf configuration file

You can set defaults in ~/.ws_user.conf (YAML format) to avoid typing the same options repeatedly:

duration: 30              # Default workspace lifetime (first line must not start with #!)
reminder: 5               # Days before expiration to send reminder
groupname: projectgroup   # Default group for -G option (e.g., bw11a000)
# mail: custom@example.com  # Optional - only to override email from identity provider

Important: Some versions mistakenly interpret a leading # comment as email. First line must be a setting, not a comment. Inline comments are fine.

Benefits:

  • Simplifies commands: ws_allocate myWs instead of ws_allocate -r 5 -G groupname myWs 30
  • Ensures consistent settings across all operations
  • Automatic group collaboration when groupname is set

Cooperative Usage (Group Workspaces and Sharing)

When working in teams, workspaces can be shared in multiple ways.

WARNING: NEVER use chmod 777 or a+rwx on workspaces!

Do NOT make your workspace readable or writable by everyone (chmod 777, chmod a+rwx, or chmod o+rwx). This is a severe security risk:

  • Anyone on the system can read, modify, or delete your data
  • Malicious users can inject code into your workspace
  • Your data and results become unreliable
  • You violate security policies and may lose access privileges

Always use proper sharing methods: Use -g/-G flags, ws_share, or group-based permissions instead.

Important: Not all sharing methods are available on all clusters. The availability depends on:

  • Filesystem type and ACL support
  • Cluster-specific workspace tool configuration
  • Unix group setup and permissions

If one sharing method doesn't work on your cluster, try an alternative approach. The -g and -G flags are most widely supported.

Group Workspaces

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
-g option (group-readable)
-G option (group-writable)

When a workspace is created with -g it becomes a group workspace that is visible to others with ws_list -g (if in same group), and is group readable:

  $ ws_allocate -g myWs 30

When created with -G <groupname> the workspace becomes writable as well, and gets group sticky bit:

  $ ws_allocate -G projectgroup myWs 30

The group can be specified in the ~/.ws_user.conf file as well.

Important: Group members can extend group-writable workspaces (created with -G) even if the original creator is absent:

  $ ws_allocate -x -u <username> <workspace_id> <days>

This requires group write access to the workspace. This is useful when the workspace owner is unavailable and the workspace needs to be extended.

Recommendations:

  • Use -g when team members only need to read your results
  • Use -G for collaborative work where everyone writes data
  • Set groupname in ~/.ws_user.conf if you always work with the same group

Sharing with ws_share

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
ws_share command (ACL-based)

With ws_share you can share workspaces with users outside your group, using ACLs (if supported by underlying filesystem).

Note: This feature requires ACL support on the filesystem. If ws_share doesn't work on your cluster, use manual ACL commands (setfacl) or fall back to Unix group permissions.

Share workspace with users:

  $ ws_share share myWs username1 username2    # Grant read access to one or more users
  $ ws_share share -F filesystem myWs user1    # Share on specific filesystem

Unshare workspace from users:

  $ ws_share unshare myWs username1            # Remove access from specific user(s)
  $ ws_share unshare-all myWs                  # Remove access from all users

List users with access:

  $ ws_share list myWs                         # Show all users with read access

These operations are applied to all files and directories in the workspace.

Options:

  • -F <filesystem>, --filesystem: Specify the workspace filesystem
  • -h, --help: Show help message

Recommendation: Use ws_share for selective sharing with individual users, especially when they are not in your Unix group.

ACLs: Access Control Lists

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
setfacl/getfacl (ACLs)

ACLs (Access Control Lists) provide fine-grained permission control beyond standard Unix permissions. They allow sharing with specific users and groups, and support default ACLs that new files automatically inherit.

Note: ACL support varies by filesystem. Not all clusters support ACLs on workspace filesystems. If ACL commands fail, use regular Unix permissions instead.

Key advantages:

  • Share with specific users (not just groups)
  • Default ACLs - new files automatically inherit permissions
  • More flexible than Unix permissions

Note: ACLs take precedence over standard Unix permissions. View ACLs with ls -l (shown as "+" after permissions).

Quick Examples

Set workspace path in variable:

  $ DIR=$(ws_find my_workspace)

View current ACLs:

  $ getfacl "$DIR"

Important note on syntax: In all commands below, user: and group: are setfacl keywords. Replace username with the actual user login name (e.g., alice, jdoe) and groupname with the actual Unix group name (e.g., bw11a000).

Grant read-only access to a user:

  $ setfacl -Rm user:username:rX,default:user:username:rX "$DIR"
  
  # Example with actual username:
  $ setfacl -Rm user:alice:rX,default:user:alice:rX "$DIR"

Grant read-write access to a user:

  $ setfacl -Rm user:username:rwX,default:user:username:rwX "$DIR"
  
  # Example with actual username:
  $ setfacl -Rm user:jdoe:rwX,default:user:jdoe:rwX "$DIR"

Grant read-only access to a group:

  $ setfacl -Rm group:groupname:rX,default:group:groupname:rX "$DIR"
  
  # Example with actual groupname:
  $ setfacl -Rm group:bw11a000:rX,default:group:bw11a000:rX "$DIR"

Remove all ACLs:

  $ setfacl -Rb "$DIR"

Key Options

  • -R: Apply to all files and subdirectories
  • -m: Modify (add or change ACL entries)
  • -b: Remove all ACL entries
  • user:username:rwX: Set permissions for specific user (replace username with actual login)
  • group:groupname:rwX: Set permissions for specific group (replace groupname with actual group)
  • default: prefix: New files inherit these ACLs automatically
  • r: Read permission
  • w: Write permission
  • X: Execute only on directories and already-executable files (capital X)

Important: Always use the default: prefix to ensure new files get the correct permissions automatically.

Recommendation

Always prefer ws_allocate -G or ws_share first. Use manual ACLs only for complex scenarios like:

  • Sharing with specific users outside your group
  • Different permissions for different users
  • Fine-grained control not possible with -G or ws_share

Regular Unix Permissions

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
chmod/chgrp (Unix permissions)

Use standard Unix permissions with chmod and chgrp when you and your collaborators share a common Unix group.

CRITICAL WARNING:

  • NEVER use chmod 777 or a+rwx - makes your data accessible to everyone on the system
  • NEVER use chmod o+rwx or chmod o+w - allows anyone to modify or delete your files
  • Only set group permissions (g+r, g+w) for your specific research group

Quick Examples

Set workspace path in variable:

  $ DIR=$(ws_find my_workspace)

Read-only access for group:

  $ chgrp -R mygroup "$DIR"
  $ chmod -R g+rX "$DIR"

Read-write access for group:

  $ chgrp -R mygroup "$DIR"
  $ chmod -R g+rswX "$DIR"

Key Options

  • -R: Apply to all files and subdirectories
  • g+r: Group read permission
  • g+w: Group write permission
  • g+x: Group execute permission
  • X: Execute only on directories and already-executable files (capital X)
  • s: Setgid bit (set-group-ID) - new files inherit the directory's group ownership

Important: The setgid bit (g+s) ensures new files belong to the correct group, but their permissions depend on your umask. With the default umask (0022), new files will NOT be group-writable automatically. You must either:

  • Set umask 0002 in your shell so new files are group-writable by default, OR
  • Manually run chmod g+w on new files, OR
  • Use ACLs with default: entries (which override umask and handle permissions automatically)

Recommendation

Always prefer ws_allocate -G groupname over manual Unix permissions. It handles everything automatically and correctly, including the sticky bit and proper permissions on all new files.

Use manual chmod/chgrp only when -G is not available on your cluster or for fixing permissions on existing data.

ws_release - Release (Delete) Workspace

To release a workspace, use:

  $ ws_release myWs

Syntax: ws_release [options] workspace_name

What happens when you release:

  • The workspace ID can be reused immediately
  • The directory becomes inaccessible to the user
  • The data is not deleted immediately - it is moved to a deleted area
  • Released workspaces: Can be recovered using ws_restore for a short grace period (typically 1 hour after release)
  • Expired workspaces: Can be recovered for a longer period (system-dependent, typically 7-30 days after expiration, configured via keeptime in /etc/ws.conf)
  • Final deletion happens automatically during cleanup runs (usually nighttime)
  • Important: Released workspace data may still count toward your quota until final deletion occurs

Options

Option Description
-n <name> Workspace name (alternative to positional argument)
-F <filesystem> Specify filesystem where the workspace is located
--delete-data Delete all data immediately. WARNING: Workspace can NOT BE RECOVERED

Immediate Deletion (Free Quota Instantly)

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
ws_release --delete-data (immediate deletion)

If you need to free quota immediately instead of waiting for the grace period, you have two options:

Option 1: Use --delete-data flag (recommended if available on your cluster):

  $ ws_release --delete-data myWs

Availability: Check if this flag is available on your cluster with ws_release -h.

Option 2: Manual deletion before release (works on all clusters):

See the section on using rm for manual deletion below for the safe method.

CRITICAL WARNING: Both methods permanently delete data that cannot be recovered, even by system administrators.

When to use immediate deletion:

  • ✓ You're certain you don't need the data anymore
  • ✓ You've already backed up important results to permanent storage
  • ✓ You're hitting quota limits and need space immediately
  • ✗ Don't use if there's any chance you might need the data
  • ✗ Don't use if you haven't verified your backup worked

Best practice: Use regular ws_release myWs (without immediate deletion) as your default. Only use immediate deletion when you're absolutely certain and need immediate quota relief.

Using rm for Manual Deletion

If ws_release --delete-data is not available on your cluster, you can manually delete data before releasing:

  $ rm -rf -- "$(ws_find myWs)"
  $ ws_release myWs

Why this is safe:

  • -- prevents path from being interpreted as option (even if it starts with -)
  • Quotes around $(ws_find myWs) handle paths with spaces safely
  • If ws_find fails or returns empty, rm fails with "missing operand" (no deletion occurs)
  • Direct and simple - no complex patterns or subshells needed

ws_restore - Restore Expired Workspace

Works on cluster bwUC 3.0 BinAC2 Helix JUSTUS 2 NEMO2
ws_restore

For a certain (system-specific) grace time following workspace expiration, a workspace can be restored.

Syntax: ws_restore [options] workspace_name target_name | -l

Restore Procedure

(1) Display restorable workspaces:

  $ ws_restore -l

This will list all workspaces that can still be recovered. You can use -b or --brief to hide the unavailability date in the list.

(2) Create a new workspace as the target for the restore:

  $ ws_allocate restored 60

(3) Restore the expired workspace:

  $ ws_restore <full_name_of_expired_workspace> restored

Important:

  • The expired workspace must be specified using the full name as printed by ws_restore -l (not as printed by ws_list!), including username prefix and timestamp suffix (otherwise it cannot be uniquely identified)
  • The target workspace must be given with just its short name as listed by ws_list, without the username prefix
  • ws_restore can only work on the same filesystem. Ensure the new workspace is on the same filesystem as the expired workspace. Use -F <filesystem> flag if needed

Example:

  $ ws_restore username-myWs-0 restored

Options

Option Description
-l, --list List restorable workspaces. Use this first to see what can be recovered
-b, --brief Do not show unavailability date in list (used with -l)
-n <name> Workspace name (alternative to positional argument)
-t <target> Existing target workspace name (alternative to positional argument)
-F <filesystem> Specify filesystem where the workspace is located
-u <username> Username (for restoring other users' workspaces if permitted)

If Workspace Cannot Be Restored

If the workspace is not visible/restorable, it has been permanently deleted and cannot be restored, not even by system administrators.

Helix-specific - Workspace Snapshots:

On Helix, if the workspace can't be restored anymore using ws_restore, you can check the snapshots under:

  /work/.snapshots/<timepoint>/ws/

Important notes about snapshots:

  • Snapshots are point-in-time copies of the workspace filesystem
  • Changes that happened since the last snapshot was created are lost
  • Browse available snapshot timepoints in /work/.snapshots/
  • Caution: The Helix team tries to keep the latest snapshots, but they cannot guarantee that snapshots will be available at all times
  • Snapshots are a last resort when ws_restore no longer works

How to use snapshots:

  $ ls /work/.snapshots/                    # List available snapshot timepoints
  $ cd /work/.snapshots/2025-11-15_00.00.00/ws/
  $ ls                                      # Find your old workspace directory
  $ cp -r username-myWs-0 /path/to/active/workspace/  # Copy data to active workspace

Contact Helix support if you need assistance with snapshot recovery.

Please remember: Workspaces are intended solely for temporary work data, and there is no backup of data in the workspaces.

Recommendation: Email reminders are sent automatically. You can optionally customize reminder timing with -r option to be notified earlier, giving you time to extend the workspace or backup important data to appropriate permanent storage.

Cluster-Specific Information

Grace periods for expired/released workspaces vary by cluster and are configured via the keeptime parameter in /etc/ws.conf:

Cluster Expired Workspace Retention (keeptime) Released Workspace Grace Period
bwUniCluster 3.0 30 days ~1 hour
Helix System-specific (check with support) ~1 hour
JUSTUS 2 System-specific (check with support) ~1 hour
BinAC2 System-specific (check with support) ~1 hour
NEMO2 30 days ~1 hour

Note:

  • Expired workspaces (reached end of lifetime) are kept for keeptime days configured in /etc/ws.conf
  • Released workspaces (via ws_release) are deleted faster - typically within 1 hour
  • Actual deletion happens during automated cleanup runs, typically during nighttime
  • This allows users to recover accidentally released workspaces quickly, while giving more time for expired workspaces

Cluster-Specific Workspace Limits

Different clusters have different workspace policies configured in /etc/ws.conf. Below is an overview of typical settings:

Cluster Default Lifetime Max Lifetime Max Extensions User Quota Inode Quota
bwUniCluster 3.0 1 day 60 days 3 times 40 TiB 20 million
JUSTUS 2 7 days 90 days Unlimited 20 TiB 5 million
Helix N/A 30 days 10 times 10 TiB None
BinAC2 N/A 30 days 5 times None None
NEMO2 30 days 100 days 100 times 5 TiB per workspace None

Configuration parameters in /etc/ws.conf:

  • duration - Maximum lifetime in days
  • maxextensions - Number of times a workspace can be extended
  • keeptime - Days to keep expired workspaces before final deletion (typically 7-30 days)
  • Released workspaces have a shorter grace period (~1 hour) regardless of keeptime

Note: Check your specific cluster documentation for current quotas and settings, as they may change. Use ws_list -l to see available filesystems and their properties on your cluster.

Checking Workspace Quotas

The command to check workspace quota usage varies by cluster and filesystem:

Lustre-based clusters (bwUniCluster 3.0, JUSTUS 2):

  $ lfs quota -uh $(whoami) /lustre/work    # or appropriate workspace path
  $ lfs quota -uh $(whoami) /pfs/work9      # bwUniCluster 3.0

NEMO2 (Weka filesystem):

  $ nemoquota                                # Shows HOME and workspace quotas
  $ df --si $(ws_find workspace_name)        # Check specific workspace

Helix (IBM Spectrum Scale):

  $ workquotainfo                            # Shows workspace quota info

BinAC2:

  • No quota limits enforced on workspaces
  • Check available space with df command

General tip: Always check disk usage before large data operations to ensure sufficient space is available.

ws_register - Register Workspace Links

The ws_register command creates or updates symbolic links to your workspaces in a directory of your choice. This provides a convenient way to access all your workspaces from a single location.

Syntax: ws_register [-h] [--version] [-F FILESYSTEM] directory

Usage:

  $ ws_register ~/workspaces

This will create symbolic links to all your workspaces in the ~/workspaces directory. The command creates subdirectories for each filesystem (e.g., scratch/) and places workspace links inside them.

Example:

  $ mkdir -p ~/my_workspaces
  $ ws_register ~/my_workspaces
  $ ls -l ~/my_workspaces/scratch/
  lrwxrwxrwx 1 user group 45 Nov 17 10:30 user-myWs -> /work/workspace/scratch/user-myWs-0
  lrwxrwxrwx 1 user group 48 Nov 17 10:30 user-project1 -> /work/workspace/scratch/user-project1-0

Note: Links use the full workspace names (including username prefix), organized by filesystem subdirectory.

Options

Option Description
directory Directory in which links shall be created/updated (required positional argument)
-F <filesystem>, --filesystem <filesystem> Filesystem to search workspaces in. Only create links for workspaces on this filesystem
-h, --help Show help message
--version Show program's version number and exit

When to use:

  • Recommended if you work with multiple workspaces and want quick access
  • Use in your ~/.bashrc or login scripts to automatically update links at login
  • Useful for organizing workspaces by project or purpose

Example in login script:

# In ~/.bashrc or ~/.profile
# Only run in interactive shells to avoid issues with scp, rsync, etc.
if [ -t 0 ] && [ -z "$SSH_ORIGINAL_COMMAND" ] && command -v ws_register >/dev/null 2>&1; then
    mkdir -p ~/workspaces >/dev/null 2>&1
    ws_register ~/workspaces >/dev/null 2>&1
fi