ibm-granite
/

granite-3.2-8b-alora-uncertainty

Text Generation

Model card Files Files and versions

kgreenewald commited on 21 days ago

Commit

6109ad8

·

verified ·

1 Parent(s): 047c9b6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ adding the capability to provide calibrated certainty scores when answering ques
 ## Activated LoRA
 Activated LoRA (aLoRA) is a new low rank adapter architecture that allows for reusing existing base model KV cache for more efficient inference.
-Whitepaper
 [IBM Research Blogpost](https://research.ibm.com/blog/inference-friendly-aloras)

 ## Activated LoRA
 Activated LoRA (aLoRA) is a new low rank adapter architecture that allows for reusing existing base model KV cache for more efficient inference.
+[Paper](https://arxiv.org/abs/2504.12397)
 [IBM Research Blogpost](https://research.ibm.com/blog/inference-friendly-aloras)