Researchers at Princeton and Stanford have developed CALDERA, a novel algorithm to compress large language models (LLMs) by ...