-
-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not all buckets listed, remove mass appropriation buckets #94
Comments
@ltguillaume Bite me! I also have my own manifests in my bucket, the bots and other scripts are doing the rest... While it is true that some manifests/apps are outdated and the bot simply ignores them... it doesn't mean that i should delete my own bucket just because of that. (For other buckets, that you have listed alongside mine i cannot say anything) In simple terms... if you don't like it(my bucket), don't add it to the Scoop and don't use it, simple as that.... also you could use this and it handles any problems from the bucket -> https://github.com/winpax/sfsu |
@anderlli0053 Ah, I see the mistake I made here: I never meant to say that the buckets (the repos themselves) should be deleted: it's perfectly fine if you use it for your own personal benefit. What I meant to say is that they should be tagged as untrustworthy on scoop.sh or removed (read: unlisted) from there. This issue has been created within the context of scoop.sh, not of Scoop itself. I wasn't aware of sfsu and the Sprinkles library, thanks for the tip. |
Hello @ltguillaume, Your bucket is properly indexed, but according to the indexer logs, some of your manifests are malformed and cannot be parsed. Replacing the TAB control characters with spaces should resolve the issue, and all your manifests should appear within a few hours after this change.
Regarding the meta-buckets and the duplicate entries they produce, we have added a "Distinct manifests only" filter option (#58), which should address this issue. Once your manifest is indexed, it should be the only result when this option is selected. We previously discussed completely removing meta-buckets from the search results. However, as @anderlli0053 mentioned, there are legitimate uses for meta-buckets. Some may contain a mix of original and copied manifests, and certain users might prefer a single meta-bucket instead of multiple smaller ones. We also want to avoid maintaining a manual list of "Untrustworthy" buckets, as it may not be accurate and/or even misleading for the users. |
@gpailler Thank you so mucht for your complete response. I see some tabs had indeed snuck in since my last change. I'll make sure to run As for the
All this proves that the use of scoop.sh is completely unreliable by these mass appropriation buckets being indexed:
As such, it undermines the very reason Scoop even exists: keeping your software up-to-date. |
Hello @ltguillaume, To detect duplicates, used by the
Regarding your examples:
That said, I understand your concerns, but I don’t see anything particularly problematic here. https://scoop.sh is simply a search engine, and I trust users are capable of choosing the most up-to-date version or selecting from the most popular repository if they prefer a single bucket. I think https://scoop.sh offers a variety of sorting and filtering options to provides enough flexibility to meet users' needs and allow them to prioritize the results they’re looking for. |
Thanks again for the extensive reply. I'll get around to commenting on the specific examples, but for now, may I propose an improvement for the hashing method? Some manifests may have the same name and version, but their manifest works different from the others. This will be mentioned in the description. If you include the description in the calculation of the hash, this will make sure these distinct variants won't be discarded as being the same. As for repo popularity vs. up-to-dateness, I think it's pretty clear which one should win in the ordering. |
It's a fair point. I need to check if adding the description to the hash doesn't show too many real duplicates, but it's worth a try. |
My manifest does not use the portable launcher (ironic, since I created it), while keeping the profile inside The same goes for my
I think it may be a mistake to let "popularity" be a factor in this, especially when it may "win" against recency. The whole reason for Scoop's existence is to keep your software up-to-date, so any "popular" bucket that has months or years old manifests, while many new versions of the software were released, are de facto "violating" the very goal of the software (if the offering of old versions is unintended and undocumented).
The general rule doesn't apply in my sampling at all, which at least indicates that there is an issue. I propose to let go of the assumption of (3) and to address the issue by listing the manifests with the latest software versions first before applying a popularity contest to the filtering 😛
I really think it's unnecessarily messy to say the least. With the example of 28 results for the same software, no one will look at the second page. And considering you use the term "Best match" in your filters, it is highly likely that people choose a horribly outdated manifest and then trust their software is kept up-to-date. This can be improved significantly, though, by letting newer versions be leading, instead of the repo's popularity. |
Your suggestion to add descriptions in the hash to differentiate duplicates could indeed help in this situation. Out of curiosity, why not propose a PR to extras with your version? What are the advantages and disadvantages of both approaches?
When I search for an app, I tend to choose a well-known or popular bucket, even if the version is slightly outdated. I use Scoop to maintain and update my applications easily, and stability is an important factor. A popular bucket is more likely to be maintained over time than a random one. I will implement the mentioned changes, and we’ll see if it produces more accurate results. But please keep in mind that with over 170,000 manifests indexed, it’s challenging to always deliver precisely the results each user is looking for—this isn't Google 😛 |
I did try to, but there's a serious catch: with every update via Scoop, Firefox/Thunderbird and derivatives create a new profile in I added the workaround that starts the Profile Manager after the update is complete, so the user just needs to press Enter and the Scoop profile is set as default again, without the creation of another. This isn't pretty, but when used to it entirely acceptable.
I don't think I can follow that reasoning.
A popular bucket like Apart from that, it's probably easier to spot problems with smaller buckets than with those massive ones. Come to think of it, it may be a great idea to add a field to the manifest that allows the author to specify the "source bucket" of the manifest.
That's great, thank you!
Well, I don't think my argument here is based on personal taste (like Google's personalized search results). It's really about how to guarantee that working with 3rd party buckets is more reliable for everyone. |
Example: searching for
betterbird-nl
shows 3 results, all of them (in some cases often outdated) copies of the original manifest from my bucket, which is not shown. It also doesn't showbetterbird-nl
, but onlybetterbird-future-nl
.The same holds for more of the original manifests from my bucket ltguillaume/schep.
This can cause significant problems for end users, who get outdated manifests from these big secondary buckets that copy original manifests from other buckets and fail to update when they change.
I consider it bad practice to give these "mass appropriation buckets" a platform. They should be tagged as untrustworthy, or removed from scoop.sh completely.
Examples:
okibcn/ScoopMaster by @okibcn (with often very outdated versions)
samuelshi/ScoopBuckets-Unofficial by @samuelshi
cmontage/scoopbucket-third by @cmontage
anderlli0053/DEV-tools by @anderlli0053
The text was updated successfully, but these errors were encountered: