Open comments for this post @zemu on aero-deuce · 10 days ago 2h 3m 38s logged adapter, gguf, and mlx (q4) all on huggingface I made a simple landing page hosted custom inference endpoint (might not work very well because im broke) should be able to run through services like Ollama and llama.cpp locally
Open comments for this post @zemu on aero-deuce · 10 days ago 2h 3m 38s logged adapter, gguf, and mlx (q4) all on huggingface I made a simple landing page hosted custom inference endpoint (might not work very well because im broke) should be able to run through services like Ollama and llama.cpp locally
Comments 0
No comments yet. Be the first!
Sign in to join the conversation.