More than 80 AI models from Qualcomm

wyldfire · on Feb 28, 2024

EDIT - the title formerly referred to "Qualcomm open sources ...."

Calling freely downloadable weights "Open Source" quite diminishes the term. It's laudable, so I'd hate to discourage it. But it's not Open Source.

What constitutes Open Source is not vague, btw. it's well defined by the OSI [1].

Let's call it open weights or freely downloadable weights or something.

EDIT: I was mistaken - they are down sampled/right-sized w half precision:

BTW props to Qualcomm but these are "just" quantized versions of existing models? Useful, yes, but maybe not that novel.

[1] https://opensource.org/osd

ben_w · on Feb 28, 2024

1. What does "source code" even mean, in this context? I don't buy the requirement that all training data needs to also be open sourced for any of the things that people normally discuss with regard to open source topics (though that kind of openness would be good for different reasons).

Network weights and biases plus architecture models are things that can be directly build upon, just like graphics are, and I would count a photo licensed under, say, MIT, as "open source" even if the JPEG codec on the camera which took the photo was not.

2. How are these two licenses compatible with https://opensource.org/osd part 2. Source Code?

a. https://opensource.org/license/unicode-license-v3 - separately lists data files

b. https://opensource.org/license/nasa1-3-php - "Notwithstanding any provisions contained herein, Recipient is hereby put on notice that export of any goods or technical data from the United States may require some form of export license from the U.S. Government. Failure to obtain necessary export licenses may result in criminal liability under U.S. laws. Government Agency neither represents that a license shall not be required nor that, if required, it shall be issued. Nothing granted herein provides any such export license."

HPsquared · on Feb 28, 2024

Simply put, it should be reproducible with the published material. An ML model is definitely not reproducible without the training data (or a recipe to reproduce the training data).

wruza · on Feb 28, 2024

This was said with source code and binaries in mind. Source code is easy to verify, modify and rebuild. Binaries are in practice non-modifiable, non-verifyable except for costly reverse engineering.

Large models don’t work like that.

The only practical way to modify (finetune, lora, merge) a model is using its binary form. A source dataset may be interesting, but it’s non-modifiable and non-reproducible in practice due to training costs. Rebuild process is usually non-deterministic, so “verify” is basically not an option.

So technically true, practically complicated. Open weights and open dataset would be better terms.

jauntywundrkind · on Feb 28, 2024

To me, open source strongly implies an ability to usefully tinker & play with something yourself.

Some trained weights doesn't seem to qualify, to me.

ben_w · on Feb 28, 2024

For a lot of these models, especially the interesting ones, it's the only thing we can tinker and play with.

Well, most of us. I'm sure there's at least one person here who can afford to burn 25 million USD on compute just for fun.

szundi · on Feb 28, 2024

Whatever, but I think the word “source” has a meaning that is pretty sure does not equals “end result binary blob” after a lengthy process

mvkel · on Feb 28, 2024

Agreed. It's like saying it's not true "open source" if you don't include then node_modules folder in your git repo

Nullabillity · on Feb 28, 2024

This situation is more like they didn't include the package.json.

imjonse · on Feb 28, 2024

The term open source fell prey to hype, ignorance and VC money in the case of AI models. The most incorrect uses are for literally binary blobs without a recipe to reproduce (no training data in the majority of cases) and for products that are basically a thin layer over OpenAI calls which is not privacy-preserving, local or free.

Brajeshwar · on Feb 28, 2024

Really sorry. I should have double checked the content more properly before copy-pasting the title from the source.

cosmojg · on Feb 28, 2024

The point is moot. Like anything else created by an automated computer process, model weights are not protected under US copyright law. The exclusion of machine-generated works from copyright protection has been pretty well established by the US Copyright Office. In fact, going off letters and rulings recently published by the US Copyright Office[1], even the outputs of generative models are excluded from copyright protection, regardless of the amount of human skill (e.g., prompt composition, parameter selection) involved in their production.

IANAL, but at most, publishing weights with a license may amount to little more than a ToS agreement, allowing distributors a bit more leeway in managing their legal/commercial relationship with recipients of said models. In other words, breaking the terms laid out in a text file entitled "LICENSE.txt" and distributed alongside a set of model weights may constitute a breach of contract, but it is in no way a copyright violation.

[1] https://fingfx.thomsonreuters.com/gfx/legaldocs/klpygnkyrpg/...

thomastjeffery · on Feb 28, 2024

"open weights" would be just fine. There's no need to overcomplicate this.

wyldfire · on Feb 28, 2024

> What constitutes Open Source is not vague, btw. it's well defined by the OSI [1].

I'm pretty surprised to see objections to my claims here, I kinda thought I was just stating the obvious.

> Who is this organisation that they get to mandate the definition of the english language?

> What authority do they have to define the term “open source”?

This term has meaning and while its meaning started with an group of people forming an organization saying "this is what this means" its meaning doesn't derive from OSI (nor some farcical aquatic ceremony). Its meaning comes from its popular use in language. For example, Wikipedia does describe this term and how it came to be used [1].

IMO it would be unclear and confusing to use the same term Open Source to describe both what it has historically described and how model weights like these are distributed. The term "Open Source" itself was coined to disambiguate merely "open" source from "free-as-in-freedom" source.

[1] https://en.wikipedia.org/wiki/Open_source

notpushkin · on Feb 28, 2024

Probably bad editorializing on the submitter's part – I don't see open source being mentioned anywhere on Qualcomm's Hugging Face (although at least some models are distributed under an open source license it seems – not unlike blobs in Linux kernel).

abetusk · on Feb 28, 2024

Sorry, what's not open source about them? I've only checked a few models but they look to be under a BSD-3-clause license. The first few I looked at all have the same BSD-3 license [0] [1] [2].

Are you saying they've just repacked other existing models under their own banner but haven't opened sourced some other component?

[0] https://huggingface.co/qualcomm/FFNet-40S

[1] https://huggingface.co/qualcomm/ResNet50

[2] https://huggingface.co/qualcomm/VIT

sgu999 · on Feb 28, 2024

From the link the the person you're reply to posted:

> The program must include source code, and must allow distribution in source code as well as compiled form.

The compute graph with trained weight is very much a compiled form of the model. The source code would include everything needed to train that model and reproduce it.

EDIT: wasn't fast enough

visarga · on Feb 28, 2024

> The source code would include everything needed to train that model and reproduce it.

You know these models are trained on internet scrape which contains copyrighted content, so the dataset can't be open sourced. It's either this or bad models.

TDiblik · on Feb 28, 2024

In theory, you must have written some code to train the models + download the data ... just openning this code + adding logging to store the sources trained on, you could achieve trully "open source" (anybody can now go and scrape + train the same way you did and achieve the same outcome/model)

I'm not saying "opening models is bad", it's good. However imo it would be nice to have a semantic way to differentiate between those two

monocasa · on Feb 28, 2024

And there's a lot of third party closed source code in windows that means it would be difficult for them to open source it.

They don't get to claim that it's open source just because it would be too hard to actually open source.

wyldfire · on Feb 28, 2024

> The program must include source code, and must allow distribution in source code as well as compiled form.

Can you reproduce these models? if not then it's probably not open source. With a model the closest analog seems to be the training data. Is that all published?

visarga · on Feb 28, 2024

ML training runs are not reproducible, GPUs are non-deterministic when doing large sums, the order operands are added changes the result, thread execution times also depend on caching, which is hard to predict. If you want to force deterministic mode be prepared for a huge slowdown.

monocasa · on Feb 28, 2024

A lot of work at larger orgs is put into reproducible training runs. It's about the only way to debug 'did the small parameter tweak tank performance because of a hardware failure or is there something special about that parameter?'

7moritz7 · on Feb 28, 2024

Each time I see a comment like this I wonder who declared the open source initiative the single authority on this. I agree with the comment but this really rubs me the wrong way.

bick_nyers · on Feb 28, 2024

Does anyone know of any truly open source LLM model, that's better than GPT2? How do researchers reproduce results in this space?

GaggiX · on Feb 28, 2024

They almost all seem to run at FP16 precision, so they are not quantized, just optimized to run on Qualcomm NPU.

tobyjsullivan · on Feb 28, 2024

OSI seem like a privately organized special interest group.

What authority do they have to define the term “open source”?

monocasa · on Feb 28, 2024

Their original board created the term in the 90s as an alternative to GNU's free software.

tobyjsullivan · on Feb 28, 2024

Oh, thanks. OSI has been around much longer than I expected.

This post[0] dives into who coined the term (spoiler: it predates OSI by a long, long time), but it’s reasonable that OSI popularized it alongside their specific definition.

[0] https://lunduke.substack.com/p/who-really-coined-the-term-op...

IMHO, to the extent that their goal was to find a less confusing term than “free”, I’d say they’ve failed.

karolist · on Feb 28, 2024

Valid question. Why is this down voted?

spywaregorilla · on Feb 28, 2024

downloadable weights are the preferred form to modify the program.

nwsm · on Feb 28, 2024

And command line arguments are the preferred form to modify CLI programs. But their existence doesn't make the application open source.

spywaregorilla · on Feb 28, 2024

... no, command line arguments are the preferred form to interact with CLI programs, not to modify them.

edit: I'm confused how this very basic claim got downvoted.

beeboobaa · on Feb 28, 2024

> What constitutes Open Source is not vague, btw. it's well defined by the OSI [1].

Who is this organisation that they get to mandate the definition of the english language?

mynjin · on Feb 28, 2024

It's not about that.

Open Source is more like a designation. It is an agreed upon set of requirements that, if you change a requirement, it is something else. This is important.

Some things have legally protected designations such as 'ice cream'. Ice Cream has specific meaning in industry and even a grading system. If someone wants to make a cheaper product than the lowest grade of ice cream, they can't call it ice cream, they have to call it something like: frozen dairy dessert.

This makes it easy for people understand what they are actually getting and paying for.

I wouldn't get indignant about mandating english language definitions. I would be indignant that ai companies are not fulfilling the requirements to call it open source and are providing a cheaper product than the abilities that an actual open source model would provide.

bee_rider · on Feb 28, 2024

It also has an obvious english meaning. The source of the model isn’t fully open, it is not possible to inspect and modify the input used to build the models.

kory · on Feb 28, 2024

Hi HN! I'm a member of the team that worked hard on AI Hub and AI Hub Models for the last few years. Excited to see our work show up here!!

I also encourage you take a look at our GitHub repository: https://github.com/quic/ai-hub-models

If you have questions or feature requests, you can reach out to us on Slack (https://join.slack.com/t/qualcomm-ai-hub/shared_invite/zt-2d...) or file an issue on GitHub / Huggingface. We are pretty responsive!

bookofjoe · on Feb 28, 2024

I'm reminded of med school when one day on rounds our internal medicine attending remarked that there are scores of treatments for hiccups, none of which have been shown to be superior to the others, which is why there are scores of treatments.

neutralino1 · on Feb 28, 2024

There are cures for hiccups!?

karmakaze · on Feb 28, 2024

I found something that works for me. Hold my breath for a short while, then start breathing slowly and continue breathing slowly. Then I might start breathing normally and if it comes back repeat the process. Usually I can make them stop in a few minutes (or sometimes 5+ mins). On certain occasions I can't control them, e.g. eating very spicy food without realizing I was doing so or exceeding my max limit for spicy-ness (which used to be very high but I don't partake as much).

bookofjoe · on Feb 28, 2024

Hiccup: Mystery, Nature and Treatment

https://www.jnmjournal.org/journal/view.html?doi=10.5056/jnm...

Systemic review: the pathogenesis and pharmacological treatment of hiccups

https://pubmed.ncbi.nlm.nih.gov/26307025/

karmakaze · on Feb 28, 2024

There's probably an economic name for this, but it makes sense for non-leading companies or companies in an adjacent field to make public some secret sauce of a leading company's proprietary assets. e.g companies other than Google/Apple/MS to be funding or open-sourcing web browsers. AI models could be one of those things where we don't want one company to have most of the marbles.

test6554 · on Feb 28, 2024

I used GPT to generate a sprite sheet, but getting the transparent backgrounds takes a little manual elbow grease.

Looking forward to an AI model that can cleanly remove backgrounds and jaggies from an image.

rabbitmq-for · on Feb 28, 2024

There is a pretty interesting system to run the models on the actual mobile devices too. Seems they are using a cloud of mobile devices to make sure the models run on device.

generalizations · on Feb 28, 2024

Followed the links to see what Whisper looks like here, and I'm kinda disappointed. They call their model [0] "Whisper-base" but the model checkpoint they're using is 'tiny-en'. There's a pretty significant performance difference between whisper-tiny and whisper-base.

[0] https://aihub.qualcomm.com/models/whisper_asr

kory · on Feb 28, 2024

Thanks for the feedback. We're aware of this issue, and it will be fixed (labeled correctly) in our next release.

Whisper small and base are coming in the next release as well, so look out for that in the next week or two.

generalizations · on Feb 28, 2024

Either way it looks like some incredible work - I'm looking forward to finding a way to mess around with these! Will definitely look out for that.