Bobcares

GSUTIL RM Wildcard | An Introduction

by | Feb 12, 2024

The GSUTIL Wildcard rm delete objects with gsutils following a wildcard pattern. As part of our Google Cloud Platform Support Service, Bobcares provides answers to all of your questions.

GsUTIL Wildcard RM

Removing an object cannot be undone. However, with Google Cloud Storage, developers have remarkable flexibility and control over their data. Also, Google strictly regulates how deleted data is handled and removed. On the bucket(s), we can set up object versioning to safeguard ourself against errors. The gsutil rm command removes objects. Let’s look into the details:

1. The code gsutil rm gs://bucket/subdir/* can delete all objects in gs://bucket/subdir, but not in any of its sub-directories.

2. The code gsutil rm gs://bucket/subdir/** can delete all objects under gs://bucket/subdir or any of its subdirectories.

3. The following commands can remove gs://bucket/subdir and all objects and subdirectories under it:

gsutil rm wildcard

For versioning-enabled buckets, the ** command will only remove the live version of each item in the subdirectory; in contrast, the -R option will also remove all object versions in the subdirectory.

4. Running gsutil rm -R on a bucket gsutil rm -R gs://bucket will delete all versions of all objects in the bucket, and then delete the bucket.

5. We can use code gsutil rm gs://bucket/** to delete all objects in the bucket, but not the bucket itself.

6. Use the gsutil -m option to execute a parallel (multi-threaded/multi-processing) remove if we have a lot of objects to get rid of: gsutil -m rm -R gs://my_bucket/subdir.

7. Commands like gsutil rm *.txt will not work as gsutil rm won’t delete files from the local file system.

8. In order to delete objects with gsutils following a wildcard pattern, we can try gsutil rm -r gs://bucket/folder/{WILDCARD-A}/folderA/{WILDCARD-B}/folderB applying the wildcard patterns.

We can use the gsutil -m option to allow multi-threading/multi-processing if we have a lot of objects to remove. Avoid using gsutil if we need to mass delete 100,000 or more objects because it takes a long time to finish. Alternatively, we can use Object Lifecycle Management, which can delete any number of objects, or the Google Cloud console, which can delete up to several million things.

9. When using wildcards in gsutil to delete a specific subdirectory, we must enclose the wildcards in single or double quotes.Before gsutil notices it (and complains that no local files match the wildcard), zsh is trying to enlarge the wildcard. Kindly attempt the following to stop zsh from doing so:
gsutil rm 'gs://bucket/**'.

[Want to learn more? Click here to reach us.]

Conclusion

To conclude, the article offers the a detailed explanation on use of GSUTIL RM command in Wildcards.

PREVENT YOUR SERVER FROM CRASHING!

Never again lose customers to poor server speed! Let us help you.

Our server experts will monitor & maintain your server 24/7 so that it remains lightning fast and secure.

GET STARTED

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *

Never again lose customers to poor
server speed! Let us help you.

Privacy Preference Center

Necessary

Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.

PHPSESSID - Preserves user session state across page requests.

gdpr[consent_types] - Used to store user consents.

gdpr[allowed_cookies] - Used to store user allowed cookies.

PHPSESSID, gdpr[consent_types], gdpr[allowed_cookies]
PHPSESSID
WHMCSpKDlPzh2chML

Statistics

Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.

_ga - Preserves user session state across page requests.

_gat - Used by Google Analytics to throttle request rate

_gid - Registers a unique ID that is used to generate statistical data on how you use the website.

smartlookCookie - Used to collect user device and location information of the site visitors to improve the websites User Experience.

_ga, _gat, _gid
_ga, _gat, _gid
smartlookCookie
_clck, _clsk, CLID, ANONCHK, MR, MUID, SM

Marketing

Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.

IDE - Used by Google DoubleClick to register and report the website user's actions after viewing or clicking one of the advertiser's ads with the purpose of measuring the efficacy of an ad and to present targeted ads to the user.

test_cookie - Used to check if the user's browser supports cookies.

1P_JAR - Google cookie. These cookies are used to collect website statistics and track conversion rates.

NID - Registers a unique ID that identifies a returning user's device. The ID is used for serving ads that are most relevant to the user.

DV - Google ad personalisation

_reb2bgeo - The visitor's geographical location

_reb2bloaded - Whether or not the script loaded for the visitor

_reb2bref - The referring URL for the visit

_reb2bsessionID - The visitor's RB2B session ID

_reb2buid - The visitor's RB2B user ID

IDE, test_cookie, 1P_JAR, NID, DV, NID
IDE, test_cookie
1P_JAR, NID, DV
NID
hblid
_reb2bgeo, _reb2bloaded, _reb2bref, _reb2bsessionID, _reb2buid

Security

These are essential site cookies, used by the google reCAPTCHA. These cookies use an unique identifier to verify if a visitor is human or a bot.

SID, APISID, HSID, NID, PREF
SID, APISID, HSID, NID, PREF