Abstract: Recently, improving the residual structure and designing efficient convolutions have become important branches of lightweight visual reconstruction model design. We have observed that the ...
Do computer vision foundation models learn the low-level characteristics of the human visual system?
Abstract: Computer vision foundation models, such as DINO or OpenCLIP, are trained in a self-supervised manner on large image datasets. Analogously, substantial evidence suggests that the human visual ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results