More

sp332 · on May 25, 2024

I was browsing test kits where you send a sample back to a lab. They're $79 - $300 and advertise 1 or 2 parts per trillion. Looks like getting to 0.02 ppt requires some very specialized equipment, and would probably be optimized for continuous monitoring of a water supply.

sp332 · on May 25, 2024

Fine tuning can be useful if you need to generate lots of output in a particular format. You can fine-tune on formatted messages, and then the model will generate that automatically. That could save a bunch of tokens explaining the output format in every prompt.

NeutralForest · on May 25, 2024

You can use structured generation instead of fiddling with the prompt, which is unreliable. https://github.com/outlines-dev/outlines

codetrotter · on May 25, 2024

Does this Python package control the LLMs using something other than text? Or is the end result still that that Python package wraps your prompt with additional text containing additional instructions that become part of the prompt itself?

tikhonj · on May 25, 2024

Looks like it actually changes how you do token generation to conform to a given context-free grammar. It's a way to structure how you sample from the model rather than a tweak to the prompt, so it's more efficient and guarantees that the output matches the formal grammar.

There's a reference to the paper that describes the method at the bottom of the README: https://arxiv.org/pdf/2307.09702

sp332 · on May 25, 2024

The output of the LLM is not just one token, but a statistical distribution across all possible output tokens. The tool you use to generate output will sample from this distribution with various techniques, and you can put constraints on it like not being too repetitive. Some of them support getting very specific about the allowed output format, e.g. https://github.com/ggerganov/llama.cpp/blob/master/grammars/... So even if the LLM says that an invalid token is the most likely next token, the tool will never select it for output. It will only sample from valid tokens.

progbits · on May 25, 2024

No it limits what tokens the LLM can output. The output is guaranteed to follow the schema.

sp332 · on May 25, 2024

How much money do you have?

tedunangst · on May 25, 2024

What does it cost me to p .95 reliably differentiate 0.04 ppt from 0.02 ppt?

jeffbee · on May 25, 2024

In water? Doesn't seem like an issue.

Brananarchy · on May 28, 2024

In pure water it's not an issue. In drinking water which has all sorts of "stuff", getting very to that level of precision isn't easy.

sp332 · on May 24, 2024

The doctor in the documentary tells him that his liver is damaged like an alcoholic's. He just let people assume it was from the food.

sp332 · on May 24, 2024

He said it was this specific performance that convinced Weird Al that he was the right guy. Also the movie is great.

sp332 · on May 24, 2024

Since the 777 is older, and this has never happened, and they are giving airlines 5 years to make the change, it doesn't seem likely to be an actual problem.

damnesian · on May 24, 2024

Let's hope so, it's not a problem you can afford to be around even once.

sp332 · on May 23, 2024

It's not "reportedly".

ranger_danger · on May 23, 2024

so it wasn't reported?

Kyrio · on May 23, 2024

Not by any serious outlet. It's a Twitter rumor started by a nobody... Another reason why letting people pay for blue checks was an incomprehensibly stupid idea.

sp332 · on May 23, 2024

As the body of the article says repeatedly, it's only a rumor.

vhost- · on May 23, 2024

It’s rumoredly.

sp332 · on May 23, 2024

I don't remember Currents or Spaces.

sp332 · on May 23, 2024

Similar to the deal they got from Microsoft, >$10B on paper but a lot of that is in the form of Azure credits.

sp332 · on May 23, 2024

https://en.wikipedia.org/wiki/Long_Now_Foundation