How can I use pathlib to recursively iterate over all subdirectories of a given directory?
p = Path('docs')
for child in p.iterdir(): child
only seems to iterate over the immediate children of a given directory.
I know this is possible with os.walk() or glob, but I want to use pathlib because I like working with the path objects.
Answers:
Thank you for visiting the Q&A section on Magenaut. Please note that all the answers may not help you solve the issue immediately. So please treat them as advisements. If you found the post helpful (or not), leave a comment & I’ll get back to you as soon as possible.
Method 1
Use Path.rglob (substitutes the leading ** in Path().glob("**/*")):
path = Path("docs")
for p in path.rglob("*"):
print(p.name)
Method 2
You can use the glob method of a Path object:
p = Path('docs')
for i in p.glob('**/*'):
print(i.name)
Method 3
To find just folders the right glob string is:
'**/'
So to find all the paths for all the folders in your path do this:
p = Path('docs')
for child in p.glob('**/'):
print(child)
If you just want the folder names without the paths then print the name of the folder like so:
p = Path('docs')
for child in p.glob('**/'):
print(child.name)
Method 4
pathlib has glob method where we can provide pattern as an argument.
For example : Path('abc').glob('**/*.txt') – It will look for current folder abc and all other subdirectories recursively to locate all txt files.
Method 5
Use list comprehensions:
(1) [f.name for f in p.glob("**/*")] # or
(2) [f.name for f in p.rglob("*")]
You can add if f.is_file() or if f.is_dir() to (1) or (2) if you want to target files only or directories only, respectively. Or replace "*" with some pattern like "*.txt" if you want to target .txt files only.
See this quick guide.
All methods was sourced from stackoverflow.com or stackexchange.com, is licensed under cc by-sa 2.5, cc by-sa 3.0 and cc by-sa 4.0