Technology

34832 readers

1 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago

MODERATORS

MinutePhrase@lemmy.ml

1526

Tech Workers Coalition onboarding meeting - Thursday Sep 20th, 5pm PT - Register now! (us02web.zoom.us)

submitted 2 years ago by chobeat@lemmy.ml to c/technology@lemmy.ml

0 comments fedilink

1527

261

Musk says a 50% drop in ad revenue for Twitter is causing negative cash flow (www.phonearena.com)

submitted 2 years ago by wrath0110@midwest.social to c/technology@lemmy.ml

26 comments fedilink

1528

175

Tedd.it is Shutting Down (tedd.it)

submitted 2 years ago by dvdnet90@lemmy.world to c/technology@lemmy.ml

14 comments fedilink

1529

There's Now a Rapid, Accurate COVID-19 Air Detector (time.com)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

7 comments fedilink

Already, the research team is working on a device that could also identify influenza and RSV.

1530

Twitter vs. Threads: We’re not taking sides — but all hail the ‘fediverse’ (www.thestar.com)

submitted 2 years ago by Wilshire@lemmy.ml to c/technology@lemmy.ml

9 comments fedilink

1531

Lawyers using ChatGPT cite 6 fake cases (archive.is)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

1 comments fedilink

1532

Artificial Intelligence Is Making The Housing Crisis Worse (www.levernews.com)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

13 comments fedilink

1533

"Johnny Cash" - Nothing is true when everything is permitted (www.youtube.com)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

5 comments fedilink

1534

Petals - Run large language models at home, BitTorrent‑style (lemmy.world)

submitted 2 years ago* (last edited 2 years ago) by Blaed@lemmy.world to c/technology@lemmy.ml

8 comments fedilink

cross-posted from: https://lemmy.world/post/1535820

I'd like to share with you Petals: decentralized inference and finetuning of large language models

https://petals.ml/

https://research.yandex.com/blog/petals-decentralized-inference-and-finetuning-of-large-language-models

What is Petals?

Run large language models at home, BitTorrent‑style

Run large language models like LLaMA-65B, BLOOM-176B, or BLOOMZ-176B collaboratively — you load a small part of the model, then team up with people serving the other parts to run inference or fine-tuning. Single-batch inference runs at 5-6 steps/sec for LLaMA-65B and ≈ 1 step/sec for BLOOM — up to 10x faster than offloading, enough for chatbots and other interactive apps. Parallel inference reaches hundreds of tokens/sec. Beyond classic language model APIs — you can employ any fine-tuning and sampling methods, execute custom paths through the model, or see its hidden states. You get the comforts of an API with the flexibility of PyTorch.

Colab Link

GitHub Docs

Overview of the Approach

On a surface level, Petals works as a decentralized pipeline designed for fast inference of neural networks. It splits any given model into several blocks (or layers) that are hosted on different servers. These servers can be spread out across continents, and anybody can connect their own GPU! In turn, users can connect to this network as a client and apply the model to their data. When a client sends a request to the network, it is routed through a chain of servers that is built to minimize the total forward pass time. Upon joining the system, each server selects the most optimal set of blocks based on the current bottlenecks within the pipeline. Below, you can see an illustration of Petals for several servers and clients running different inputs for the model.

Benchmarks

We compare the performance of Petals with offloading, as it is the most popular method for using 100B+ models on local hardware. We test both single-batch inference as an interactive setting and parallel forward pass throughput for a batch processing scenario. Our experiments are run on BLOOM-176B and cover various network conditions, from a few high-speed nodes to real-world Internet links. As you can see from the table below, Petals is predictably slower than offloading in terms of throughput but 3–25x faster in terms of latency when compared in a realistic setup. This means that inference (and sometimes even finetuning) is much faster with Petals, despite the fact that we are using a distributed model instead of a local one.

Conclusion

Our work on Petals continues the line of research towards making the latest advances in deep learning more accessible for everybody. With this work, we demonstrate that it is feasible not only to train large models with volunteer computing, but to run their inference in such a setup as well. The development of Petals is an ongoing effort: it is fully open-source (hosted at https://github.com/bigscience-workshop/petals), and we would be happy to receive any feedback or contributions regarding this project!

You can read the full article here

1535

157

Google lays off contractors who unionized last month | Engadget (www.engadget.com)

submitted 2 years ago by Barns@lemmy.world to c/technology@lemmy.ml

10 comments fedilink

1536

-2

Announcing Windows 11 Insider Preview Build 23475 (blogs.windows.com)

submitted 2 years ago* (last edited 2 years ago) by roon@lemmy.ml to c/technology@lemmy.ml

3 comments fedilink

TL;DR from the article: This build includes a handful of new features we’re beginning to roll out to Windows Insiders in the Dev Channel including a modernized File Explorer Home and address bar, Dynamic Lighting, and support for Emoji 15. As part of introducing these new features, we’ve also added some new known issues. We’re also releasing a new Microsoft Store update.

1537

July 14: Meta's entry into image generation and editing (sh.itjust.works)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

3 comments fedilink

https://www.maginative.com/article/meta-unveils-cm3leon-a-breakthrough-ai-model-for-advanced-text-to-image-generation-and-image-understanding/

1538

Wireshark Is 25: The email that started it all and lessons learned along the way (blog.wireshark.org)

submitted 2 years ago by BrikoX@lemmy.zip to c/technology@lemmy.ml

2 comments fedilink

1539

Microsoft takes pains to obscure role in 0-days that caused email breach (arstechnica.com)

submitted 2 years ago by BrikoX@lemmy.zip to c/technology@lemmy.ml

3 comments fedilink

1540

archive.org flash emulation quality just increased (mastodon.archive.org)

submitted 2 years ago by can@sh.itjust.works to c/technology@lemmy.ml

1 comments fedilink

1541

The Baconing of AI Imagery (youtu.be)

submitted 2 years ago* (last edited 2 years ago) by gnarly@lemmy.world to c/technology@lemmy.ml

1 comments fedilink

I'm a former VFX artist for Robot Chicken and a few other [adultswim] shows. These are a few of my initial thoughts on AI imagery after 1 year in the can. Excuse my vtubers model, as I'm not fully condemning AI I've had to hide my identity a bit after some nasty was thrown my way online. Won't stop me from making content though. Take care!

1542

The Questionable Engineering of Oceangate (yewtu.be)

submitted 2 years ago by HiddenLayer5@lemmy.ml to c/technology@lemmy.ml

17 comments fedilink

1543

Facial recognition surveillance in São Paulo could worsen racism (www.aljazeera.com)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

0 comments fedilink

More than 90% of arrests based on facial recognition are Black. In state of Rio de Janeiro, unjust arrests involving Black individuals reached 81%.

1544

Light-based “LiFi” is stunningly fast, notably fragile—and now standardized (arstechnica.com)

submitted 2 years ago by BrikoX@lemmy.zip to c/technology@lemmy.ml

16 comments fedilink

1545

Using bigger AI training data sets may produce more racist results (archive.is)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

1 comments fedilink

Contrary to Silicon Valley wisdom, training AIs on larger data sets could worsen their tendency to replicate societal biases and racist stereotypes

1546

AI-powered companion robots could end loneliness in older adults (interestingengineering.com)

submitted 2 years ago by cyu@sh.itjust.works to c/technology@lemmy.ml

10 comments fedilink

New study reports companion robots with artificial intelligence may one day help alleviate loneliness epidemic. Surgeon General says loneliness may be as pernicious as cigarettes.

1547

150

AI panic is a marketing strategy (lemmy.ml)

submitted 2 years ago by chobeat@lemmy.ml to c/technology@lemmy.ml

40 comments fedilink