Square convolution is a default unit in convolutional neural networks as...
Unlike language tasks, where the output space is usually limited to a se...
Differentiable architecture search (DARTS) has attracted much attention ...
We present techniques for scaling Swin Transformer up to 3 billion param...
The vision community is witnessing a modeling shift from CNNs to
Transfo...