1 article in this category
llama.cpp server introduces router mode, enabling dynamic loading and switching between multiple models without restarts.