Spaces:

Zethris-Temporal-Loom
/

Folio3_Employee_Handbook_Assistant

Running

Zethris-Temporal-Loom commited on Jan 31

Commit

b5bfd74

verified ·

1 Parent(s): b6de80d

update LLM for response generation

Files changed (1) hide show

app.py CHANGED Viewed

@@ -790,7 +790,8 @@ def respond(
         # Stream response
         response = client.chat.completions.create(
             messages=[{"role": "user", "content": prompt}],
-            model="deepseek-r1-distill-llama-70b",
             stream=True,
         )
         cumulative_response = ""  # Keep track of the cumulative response

         # Stream response
         response = client.chat.completions.create(
             messages=[{"role": "user", "content": prompt}],
+            model="llama-3.1-8b-instant",
+            # model="llama-3.3-70b-versatile",
             stream=True,
         )
         cumulative_response = ""  # Keep track of the cumulative response