Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added untar progress similar to existing unzip #17519

Open
wants to merge 4 commits into
base: develop2
Choose a base branch
from

Conversation

perseoGI
Copy link
Contributor

Changelog: Feature: Add report progress while unpacking tarball files
Docs: omit

Closes #14589

  • Refer to the issue that supports this Pull Request.
  • If the issue has missing info, explain the purpose/use case/pain/need that covers this Pull Request.
  • I've read the Contributing guide.
  • I've followed the PEP8 style guides for Python code.
  • I've opened another PR in the Conan docs repo to the develop branch, documenting this one.

Copy link
Member

@memsharded memsharded left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are different pieces in this PR.
For the untargz it seems the same approach as used in the FileUploader makes sense, but reusing same code, prepared for later refactor.

For the zip uncompress, if it can use the same class FileProgress, then great, if not, maybe leave that untouched.

# NOT EXPOSED at `conan.tools.files` but used in tests
import tarfile
with tarfile.TarFile.open(filename, 'r:*') as tarredgzippedFile:
import io
class ProgressFileObject(io.FileIO):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We already have a class FileProgress in the FileUploader, maybe it is the time to extract it and reuse it. Or at least copy it as-is, and leave it for a later refactor. But it would be that at least the initial implementation is the same.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've extracted that FileProgress class and adapted it to different scenarios! See https://github.com/conan-io/conan/pull/17519/files#diff-9afbf22c7cefd3f7b642e9b1b988ba829d3c36e3092bfb6140d01ad0bba4c571R537

if current_percentage >= self.percentage + self.step:
if 100 - current_percentage == self.step:
current_percentage = 100
self.output.rewrite_line(txt_msg % current_percentage)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rewrites are not that great, specially for CI output, which result in endless and annoying logs if enabled, and not a trace at all if not enabled. The "timed-output" seems to be a good balance.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We were already using output.rewrite_line for the unzip progress but I agree it could cause some inconvenience.
With this refactor, we could remove the rewrite by using FileProgress class.

Copy link
Member

@memsharded memsharded left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FileProgress wrapping the full file object, including the wrapping with open/close context is a bit risky, as the file descriptor will not be properly closed. This is one of the reasons why the previous wrapper was not inheriting from io.FileIO and wrapping the minimum necessary for the progress.

@@ -549,3 +532,17 @@ def move_folder_contents(conanfile, src_folder, dst_folder):
os.rmdir(src_folder)
except OSError:
pass


class FileProgress(io.FileIO):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file is not the right place for this class, this file should be intended only for recipe importable from conan.tools. Things from this module shouldn't be used by Conan internals, and this needs to be imported from the internal "upload" code.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okey, totally right, I've moved back to previous conans location. Consider moving in the future to conan.internal

# NOT EXPOSED at `conan.tools.files` but used in tests
import tarfile
with tarfile.TarFile.open(filename, 'r:*') as tarredgzippedFile:
with tarfile.TarFile.open(fileobj=FileProgress(filename, msg="Uncompressing"), mode='r:*') as tarredgzippedFile:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fileob is not being closed correctly, as it is not called with with FileProgres. It would be better if this file is being correctly closed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using now with statement to manage both file and resource.

pass

with zipfile.ZipFile(filename, "r") as z:
with zipfile.ZipFile(FileProgress(filename, msg="Unzipping", mode="r")) as z:
Copy link
Member

@memsharded memsharded Dec 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above, file not being closed

@memsharded memsharded self-assigned this Dec 27, 2024
@perseoGI
Copy link
Contributor Author

FileProgress wrapping the full file object, including the wrapping with open/close context is a bit risky, as the file descriptor will not be properly closed. This is one of the reasons why the previous wrapper was not inheriting from io.FileIO and wrapping the minimum necessary for the progress.

I understand your concern. I tried the original FileProgress approach but wanted to do a more "general" FileProgress wrapper that could work in different scenarios, such as this one:
https://github.com/perseoGI/conan/blob/8fe92be6d52f5fa357fed25d83ddced6b28b8f15/conan/tools/files/files.py?plain=1#L356
The only way I found to get a tarfile progress without any impact in the decompression is to create a wrapper of the fileobj class.
Please look at the new changes which now use with statement simplifying the consumer side.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[feature] Report progress of unpacking source files
2 participants