The "why would they make this" people don't understand how important this type of research is. It's important to show what's possible so that we can be ready for it. There are many bad actors already pursuing similar tools if they don't have them already. The worst case is being blindsided by something not seen before.
The rest of the budget kind of sucks but this part makes sense. If you're making significant profits off of users in a country you should have to pay some of that back. All countries should have this.
Cohere's command-r models are trained for exactly this type of task. The real struggle is finding a way to feed relevant sources into the model. There are plenty of projects that have attempted it but few can do more than pulling the first few search results.
I don't think the term open-source can be applied to model weights. Even if you have the exact data, config, trainer and cluster it's basically impossible to reproduce an exact model. Calling a model "open" sort of works but then there's the distinction between open for research and open for commercial use. I think it's kind of similar to the "free" software distinction. Maybe there's some Latin word we could use.
Your best bet would probably be to get a used office PC to put the card in. You'll likely have to replace the power supply and maybe swap the storage but with how much proper external enclosures go for the price might not be too different. Some frameworks don't support direct GPU loading so make sure that you have more ram than vram.
An arm soc won't work in most cases due to a lack of bandwidth and software support. The only board I know of that can do it is the rpi5 and that's still mostly a poc.
In general I wouldn't recomend a titan x unless you already have one because it's been deprecated in cuda, so getting modern libraries to work will be a pain.
I really like the simplicity and formatting of stock pacman. It's not super colorful but it's fast and gives you all of the info you need. yay (or paru if you're a hipster) is the icing on top.
The "AI PC" specification requires a minimum of 40TOPs of AI compute which is over double the 18TOPs in the current M3s. Direct comparison doesn't really work though.
What really matters is how it's made available for development. The Neural engine is basically a black box. It can't be incorporated into any low level projects because it's only made available through a high-level swift api. Intel by comparison seems to be targeting pytorch acceleration with their libraries.
This article is grossly overstating the findings of the paper. It's true that bad generated data hurts model performance, but that's true of bad human data as well. The paper used opt125M as their generator model, a very small research model with fairly low quality and often incoherent outputs. The higher quality generated data which makes up a majority of the generated text online is far less of an issue. The use of generated data to improve output consistency is a common practice for both text and image models.
It's size makes it basically useless. It underperforms models even in it's active weight class. It's nice that it's available but Grok-0 would have been far more interesting.
I feel like the whole Reddit AI deal is a trap. If any real judgment comes down about data use Reddit is an easy scapegoat. There was basically nothing stopping them from scraping the site for free.
Don't buy a Chromebook for linux. While driver support usually isn't an issue, the alternative keyboard layout is terrible for most applications. To even get access to all of the normal keys that many applications expect you need to configure multi-key shortcuts which varies in complexity based on your DE. In most cases it will also void your warranty because of the custom firmware requirement.
I got locked out of my now 8+ year old account because I had set it up with an old ISP provided email which has since been deactivated. I can't migrate because I have to verify with the email and I can't change the email without setting up security questions, which also requires the email. Support can do nothing.
According to a recent tweet shared by AI enthusiast Nick St. Pierre, the alleged theft occurred last Saturday. It is claimed that employees from Stability AI infiltrated Midjourney's database and stole all prompt and image pairs, an action that also caused a 24-hour outage. In response, MJ reportedly banned all Stable Diffusion...
I don't think they care about the images being used, just the disruption of service. It's pretty clear that this wasn't a coordinated thing from Stability and was at most a lone individual acting in bad faith.
It's pretty ironic though that the company that practices mass scraping has no rate limits to prevent outages due to mass scraping.
To compile optimal video, audio, and subtitle track combinations of videos for my media library, I've found MPC-HC's millisecond counter and frame skip features useful for finding the exact offset between different video and audio tracks. After using MKVToolNix to combine the video track of an MP4 file with the delay-adjusted...
There should be no difference because the video track hasn't been touched. Some software will display the length of the longest track rather than the length of the main video track. It's likely that the the audio track was originally longer than the video track and because of the offset it's now shorter.
You can use tools like ffmpeg and mediainfo to count the actual frames in each to verify.
They are asking a federal judge to say yes to this, specifically:
Developing or distributing software, including Yuzu, that in its ordinary course functions only when cryptographic keys are integrated without authorization, violates the Digital Millennium Copyright Act’s prohibition on trafficking in devices that circumvent effective technological measures, because the software is primarily designed for the purpose of circumventing technological measures.
So I think they're definitely intending to set precedent with this case, though this settlement hasn't been accepted by the court yet.
I believe USB-C is the only connector supported for carrying DisplayPort signals other than DisplayPort itself.
The biggest issue with USB-C for display in my opinion is that cable specs vary so much. A cable with a type c end could carry anywhere from 60-10000MB/s and deliver anywhere from 5-240W. What's worse is that most aren't labeled, so even if you know what spec you need you're going to have a hell of a time finding it in a pile of identical black cables.
Not that I dislike USB-C. It's a great connector, but the branding of USB has always been a mess.
From the abstract: "Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single parameter (or weight) of the LLM is ternary {-1, 0, 1}."...
This looks like an ad. They go on about what their proprietary detection method found without any details about how it came to these conclusions or even how they generated the test data. They give 0 actual examples for any of their claims.
Hello internet users. I have tried gpt4all and like it, but it is very slow on my laptop. I was wondering if anyone here knows of any solutions I could run on my server (debian 12, amd cpu, intel a380 gpu) through a web interface. Has anyone found any good way to do this?
Koboldcpp should allow you to run much larger models with a little bit of ram offloading. There's a fork that supports rocm for AMD cards: https://github.com/YellowRoseCx/koboldcpp-rocm
Make sure to use quantized models for the best performace, q4k_M being the standard.
From my experience the model is pretty bad compared to both 7B llama2 and mistral. It's also larger than both. It might be caused by bad instruction tuning, but overall my expectations are pretty low.
Reddit is a ‘smaller, more volatile’ Twitter, says Big Technology’s Alex Kantrowitz::Alex Kantrowitz, Big Technology founder, joins 'Squawk Box' to discuss Reddit's decision to go public, the company's journey to IPO, Sam Altman's stake in the company, and more.
The ~400USD price tag is really impressive, but the big thing with these folding phones is the reliability of the hinge. It will be interesting to see how it fares when proper reviews come in.
“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI::Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang,...
This isn't necessarily about just hardware. Current ML architectures and inference engines are far from being at peak efficiency. Just last year we saw 20x speedups for llm inference on some hardware. "a million times" is obviously hyperpole though.
I like to run qBittorrent on a Linux laptop to try and minimize the risk of Microsoft tracking what I'm doing and to protect my main system from viruses. I'm having an issue with it though. When I go into Advanced and select Network Interface, Proton VPN is not an option there. I only get lo, eno1, wlp4s0, ipv6leakintrf0, and...
Weird that he's suddenly interested in the UAE considering that he'd be persecuted there. Couldn't have anything to do with that money he's trying to raise...
Vision Pro EyeSight feature doesn't really work, argues Macworld::Vision Pro’s EyeSight feature is something Apple has stressed as a key product differentiator over rival headsets, and as a...
This is an article about another article, some top tier journalism. They're right about the external display though. I've yet to see a positive comment about it, seems like just a weird gimmick that drains the already short battery life.
Something on the lines of if your company facility is using over X amount of energy the majority of that has to be from a green source such as solar power. What would happen and is this feasible or am I totally thinking about this wrong...
There is no such thing as "green" energy, all energy has an environmental extraction/capture cost. Crypto has insane per user power usage, AI isn't quite as bad but it's still much higher than normal websearch. Both should be used sparingly in cases where they actually make sense.
OpenAI CEO Sam Altman is in talks with investors, including from the United Arab Emirates, to raise between $5 trillion to $7 trillion in funding. The goal, according to a report in The Wall Street Journal, is to increase the world's chip manufacturing capacity and enhance AI capabilities....
Tiny plastic shards found in human testicles, study says ( www.cnn.com )
Critics question tech-heavy lineup of new Homeland Security AI safety board ( arstechnica.com )
Steam will stop issuing refunds if you play two hours of a game before launch day ( www.theverge.com )
Closing the early access loophole.
DuckDuckGo AI Chat ( duckduckgo.com )
DDG is now offering free/private AI chat using several models.
Microsoft’s VASA-1 can deepfake a person with one photo and one audio track ( arstechnica.com )
Zuckerberg says Meta's Llama 3 is really good but no chatbot is sophisticated enough to be an 'existential' threat — yet ( www.businessinsider.com )
Canada to start taxing tech giants in 2024 despite U.S. complaints ( www.bnnbloomberg.ca )
GPT-4 performance comparable with physicians on official medical board residency examinations. Model performance near or above official passing rate in all medical specialties tested ( ai.nejm.org )
Instagram will blur nudes in messages sent to minors ( www.theverge.com )
The tech industry can’t agree on what open-source AI means. That’s a problem. ( www.technologyreview.com )
Meta’s AI image generator really struggles with the concept of interracial couples | CNN Business ( www.cnn.com )
Meta’s AI image generator is coming under fire for its apparent struggles to create images of couples or friends from different racial backgrounds.
‘The machine did it coldly’: Israel used AI to identify 37,000 Hamas targets ( www.theguardian.com )
Dock GPU to Laptop or to small SOC?
Afaik most LLMs run purely on the GPU, dont they?...
What is the most visually pleasing package manager (in terminal)?
MIT scientists have just figured out how to make the most popular AI image generators 30 times faster ( www.livescience.com )
How well can LLMs solve chess puzzles? ( github.com )
Each LLM is given the same 1000 chess puzzles to solve. See puzzles.csv. Benchmarked on Mar 25, 2024....
Mistral 7B v0.2 Base (released at SHACK15sf hackathon) ( twitter.com )
GitHub: https://github.com/mistralai-sf24/hackathon...
Reddit is going public. Will its unruly user base revolt? ( www.vox.com )
Microsoft’s first AI PCs are the Surface Pro 10 and Surface Laptop 6 for businesses ( www.theverge.com )
Generative AI will eventually poison itself ( www.xda-developers.com )
Grok-1 chatbot code released – open source or open Pandora's box? ( www.theregister.com )
The FTC is probing Reddit’s AI licensing deals ( www.engadget.com )
cross-posted from: https://slrpnk.net/post/7669534...
Fanless linux laptop
I'm looking for an Apple MacBook Air M2 alternative that could run Linux....
To buy no longer means anything :( ( youtu.be )
I’ve just watched the video. I find it pretty outrageous. The word about it should spread.
Midjourney Accuses Stability AI of Image Theft, Bans Its Employees ( 80.lv )
According to a recent tweet shared by AI enthusiast Nick St. Pierre, the alleged theft occurred last Saturday. It is claimed that employees from Stability AI infiltrated Midjourney's database and stole all prompt and image pairs, an action that also caused a 24-hour outage. In response, MJ reportedly banned all Stable Diffusion...
Video length variation when converting MP4 file to MKV
To compile optimal video, audio, and subtitle track combinations of videos for my media library, I've found MPC-HC's millisecond counter and frame skip features useful for finding the exact offset between different video and audio tracks. After using MKVToolNix to combine the video track of an MP4 file with the delay-adjusted...
Nintendo Switch emulator, Yuzu, developers settling lawsuit from Nintendo with $2.4M payout, handing over its domains, and agreeing "Yuzu [is] primarily designed to circumvent [DRM]". ( www.theverge.com )
This also includes ceasing development and destroying their copies of the code....
The HDMI Forum rejected AMD's open source HDMI 2.1 implementation ( www.gamingonlinux.com )
[Update] Version 0.19 Upgrade - Done!
[Update - March 3rd 2024 , 21:14 CET]...
[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits ( huggingface.co )
From the abstract: "Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single parameter (or weight) of the LLM is ternary {-1, 0, 1}."...
New report: 60% of OpenAI model's responses contain plagiarism ( www.axios.com )
A new report from plagiarism detector Copyleaks found that 60% of OpenAI's GPT-3.5 outputs contained some form of plagiarism....
Self hosted LLM
Hello internet users. I have tried gpt4all and like it, but it is very slow on my laptop. I was wondering if anyone here knows of any solutions I could run on my server (debian 12, amd cpu, intel a380 gpu) through a web interface. Has anyone found any good way to do this?
Google Is Giving Away Some of the A.I. That Powers Chatbots ( lemmy.dbzer0.com )
How important do yall expect this to be?
Reddit is a ‘smaller, more volatile’ Twitter, says Big Technology’s Alex Kantrowitz ( www.cnbc.com )
Reddit is a ‘smaller, more volatile’ Twitter, says Big Technology’s Alex Kantrowitz::Alex Kantrowitz, Big Technology founder, joins 'Squawk Box' to discuss Reddit's decision to go public, the company's journey to IPO, Sam Altman's stake in the company, and more.
[Thread, post or comment was deleted by the author]
ZTE Libero Flip launches as a particularly affordable foldable with a round secondary display and Snapdragon 7 Gen 1 ( www.notebookcheck.net )
Reddit has reportedly signed over its content to train AI models ( mashable.com )
OpenAI introduces Sora, its text-to-video AI model ( www.theverge.com )
https://openai.com/sora...
“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI ( gadgettendency.com )
“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI::Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang,...
Can't bind Proton VPN to qBittorrent on Linux
I like to run qBittorrent on a Linux laptop to try and minimize the risk of Microsoft tracking what I'm doing and to protect my main system from viruses. I'm having an issue with it though. When I go into Advanced and select Network Interface, Proton VPN is not an option there. I only get lo, eno1, wlp4s0, ipv6leakintrf0, and...
Your AI Girlfriend Is a Data-Harvesting Horror Show ( gizmodo.com )
Stopping enslavement of trafficked children ❌ Perfect place to guide the regulations of the world ✅ ( sh.itjust.works )
(Low effort - seeking appropriate meme template)...
Vision Pro EyeSight feature doesn't really work, argues Macworld ( 9to5mac.com )
Vision Pro EyeSight feature doesn't really work, argues Macworld::Vision Pro’s EyeSight feature is something Apple has stressed as a key product differentiator over rival headsets, and as a...
Because AI and Crypto use so much electricity, what if a law was made that they had to power it with green energy?
Something on the lines of if your company facility is using over X amount of energy the majority of that has to be from a green source such as solar power. What would happen and is this feasible or am I totally thinking about this wrong...
OpenAI wants to raise 5-7 trillion dollars. Yes, Trillion ( decrypt.co )
OpenAI CEO Sam Altman is in talks with investors, including from the United Arab Emirates, to raise between $5 trillion to $7 trillion in funding. The goal, according to a report in The Wall Street Journal, is to increase the world's chip manufacturing capacity and enhance AI capabilities....