Tough AI Interview Questions: Model Compression & LLM Inference