Update: I figured it out and hosted it. Clear difference in capabilities visible.
I need atleast $100/mo to host 24x7 though.
TGI makes it trivial.
Can host openai-community/gpt2 (125M, 2019), EleutherAI/gpt-neox-20b (20B, 2022), gpt-3.5-turbo (175B?, 2020) and o3 (2T?, 2025).
Update: I figured it out and hosted it. Clear difference in capabilities visible.
I need atleast $100/mo to host 24x7 though.
TGI makes it trivial.
Can host openai-community/gpt2 (125M, 2019), EleutherAI/gpt-neox-20b (20B, 2022), gpt-3.5-turbo (175B?, 2020) and o3 (2T?, 2025).