Nordic-RSE conference 2024

Name: Nordic-RSE conference 2024
Start: 2024-05-30T09:00:00+03:00
End: 2024-05-31T16:00:00+03:00
Location: Aalto University Campus

30–31 May 2024

Aalto University Campus

Europe/Helsinki timezone

Deploying Open Source LLMs for on site usage

Not scheduled

20m

Aalto University Campus

Otaniementie 13, 02150 Espoo, Finland

On-going projects Track 1

Thomas Pfau (Aalto University)

LLMs have become a tool used by many researchers in a wide variety of tasks and several libraries are available to facilitate access to the most common LLMs. At the same time many workstations used by researchers don't have the capacity to run llms locally and at the same time researchers are hesitant to feed potentially sensitive data to models hosted on external webservices like Azure or OpenAI. We have set up a local llm deployment, based on kubernetes, FastAPI and llama.cpp. The deployment provides several models along with a self-service checkout for researchers to set up their own API keys. While the service, currently is not intended for high throughput inference, it can serve as a testing ground and can easily be extended.

Thomas Pfau (Aalto University)

There are no materials yet.

Nordic-RSE conference 2024

Deploying Open Source LLMs for on site usage

Aalto University Campus

Speaker

Description

Primary author

Presentation materials

Choose timezone

Nordic-RSE conference 2024

Speaker

Description

Primary author

Presentation materials