OTAS: An Elastic Transformer Serving System via Token Adaptation

Publication
IEEE International Conference on Computer Communications (INFOCOM) (CCF-A)