AI inference optimization