V1 vision

This repository contains visual processing packages in Go (golang), focused on providing efficient V1 (primary visual cortex) level filtering of images, with the output then suitable as input for neural networks.

As of Dec, 2025, the system supports full GPU-based operations using GoSL. See below for design considerations.

Two main types of filters are supported:

Gabor filters simulate V1 simple-cell responses in terms of an oriented sine wave times a gaussian envelope that localizes the filter in space. This produces an edge detector that detects oriented contrast transitions between light and dark. In general, the main principle of primary visual filtering is to focus on spatial (and temporal) changes, while filtering out static, uniform areas.
DoG (difference of gaussian) filters simulate retinal On-center vs. Off-center contrast coding cells -- unlike gabor filters, these do not have orientation tuning. Mathematically, they are a difference between a narrow (center) vs wide (surround) gaussian, of opposite signs, balanced so that a uniform input generates offsetting values that sum to zero. In the visual system, orientation tuning is constructed from aligned DoG-like inputs, but it is more efficient to just use the Gabor filters directly. However, DoG filters capture the "blob" cells that encode color contrasts.

The v1vision package contains general-purpose filtering code that applies (convolves) any given filter with a visual input. It also supports converting an image.Image into a tensor.Float32 tensor which is the main data type used in this framework. It also supports max-pooling for efficiently reducing the dimensionality of inputs.

The kwta package provides an implementation of the feedforward and feedback (FFFB) inhibition dynamics (and noisy X-over-X-plus-1 activation function) from the Leabra algorithm to produce a k-Winners-Take-All processing of visual filter outputs -- this increases the contrast and simplifies the representations, and is a good model of the dynamics in primary visual cortex.

To more fully leverage the GPU parallel processing, there is an NData data-parallel parameter that runs n copies of each operation in parallel. This should correspond to the data-parallel batch size parameter in the simulation, so the entire batch is processed in one step. This parameter must be set at the outset (on V1Vision object) to ensure consistent memory allocations for all operations.

GoSL design

The GoSL (Go as a shader language) system is maximally efficient if everything can be configured statically in memory at the outset, and then each iteration just pushes up the new image and retrieves the final filtered results. This is accomplished by effectively compiling a programmed sequence of operations into the Ops list, and configuring everything to hold all the intermediate data results from each Op. At run-time, each Op is uploaded to the GPU in turn, and provides the control params for running that operation.

Name		Name	Last commit message	Last commit date
Latest commit History 192 Commits
.github/workflows		.github/workflows
colorspace		colorspace
dog		dog
examples		examples
fffb		fffb
gabor		gabor
kwta		kwta
motion		motion
nproc		nproc
nxx1		nxx1
v1std		v1std
v1vision		v1vision
vxform		vxform
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
doc.go		doc.go
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

V1 vision

GoSL design

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

emer/v1vision

Folders and files

Latest commit

History

Repository files navigation

V1 vision

GoSL design

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages