Blog - 2019-06-03-overview

# resize pdfs
gs -sDEVICE=pdfwrite -dPDFSETTINGS=/ebook -q -o output.pdf file.pdf
gs -sDEVICE=pdfwrite -dPDFSETTINGS=/prepress -q -o output.pdf file.pdf

Camera

2018_coded_two_bucket_cameras_for_computer_vision
- abstract
  - coded two-bucket imaging (C2B) for 3D shape estimation. the sensor modulates light arriving at each pixel and outputs 2 images per pixel. When coupled with dynamic light sources acquire illumination mosaics, which can be processsed to acquire live disparity/normal maps of dynamic objects.
- questions
  - spatial multiplexing (F frames -> 1 frame that is spatially multiplex)
    - achieved with a tiling and a correspondence of frame to pixel sampling scheme
    - what is the motivation for this
      - photometric stereo needs several photos, needs a sequence of this …
      - but scene always move in a video, adapt traditional algoritm (stereo) work under this framework (reconstruction in 1 shot)
      - similar to color imaging (several spectrum 1 in shot)
      - potential for different tiles (4x4 -> recover 16 frames)

Points

What metrics used in demosaicking papers ?
- color
  - MSE/PSNR: error between reference and reconstructed images
  - S-CIELAB: perceptual color fidelity (http://scarlet.stanford.edu/~brian/scielab/introduction.html)
- artifacts
  - a measure of zipper effect near sharp edges(https://pdfs.semanticscholar.org/9309/59339e8b69b90d18b479dbfa06049f4a5182.pdf)
  - a measure of aliasing (false color)
How confident are we in adapting bayer CFA interpolation methods to structured light images ?
- assumptions that cannot be assumed
  - constant/smooth hue assumption (spectrally, hue is constant/smooth inside boundaries of objects) since the image is grayscale
  - each pixel has
- assumptions that can be exploited
  - homogeneity assumption (spatially, neighboring pixels morel likely to have similar colors)
- some demosaicing algorithm exploit structure of bayer color filter array, which is not the case in our case.
  - sequential demosaicing methods (interpolate G channel first, then interpolate R,G) relies on the rationale tht since G channel is sampled more it is less aliased. This is not true for the two bucket camera
http://people.duke.edu/~sf59/TIP_Demos_Final_Color.pdf
- considers super-resolution and demosaicing as the same problem

hyperspectral imaging

https://link.springer.com/article/10.1007/s00138-018-0965-4
- hypterspectral imaging mosaic design
- discusses
  - how much hyperspectral imaging benefit from spectral and spatial correlation
https://link.springer.com/article/10.1007/s11042-018-6396-4
- http://sesar.di.unimi.it/jdemdb/
- a new dataset (16-bit images) for benchmarking demosaicing and denoising algorithms
Generating Training Data for Denoising Real RGB Images via Camera Pipeline Simulation
- https://arxiv.org/pdf/1904.08825.pdf
- more realistic dataset generation with realistic noise characteristics

The list of state of art methods

05.30

tabulate the psnrs in a matrix …
- only >= 2015 papers
things to keep track of
- brief description of methods
  - pro/con
  - interest
- feasibilty/bottleneck w.r.t. c2b camera
  - easy to use ?
some observation on datasets
- deep learning based ones could be trained with several hundred to 2k images
- however, this is a problem for c2b cameras, since the dataset would be fixed to particular tiling, and a particular reconstruction pattern
- just need full-res images ….
- different pattern for each frame

05.31

mosaic design could be really cool as a second step
- Deep Joint Design of Color Filter Arrays and Demosaicing
  - automatic design of filter arrangement with an autoencoder
hook up or implement method myself ?
- if its easiest to do this …
adapt the neural net for two bucket camera
- i.e. input is two bucket camera
- can do demosaic and demultiplexing at same time…
decide which one is most amenable …
pick a few that is reasonable, pros and cons, what does it take to get there
- write the email
- propose a few
short: 2014_flexISP full: FlexISP: A Flexible Camera Image Processing Framework url: http://www.cs.ubc.ca/labs/imager/tr/2014/FlexISP/FlexISP_Heide2014_lowres.pdf notes: - end-to-end image processing that enforce image priors as proximal operators and use ADMM/primal-dual for optimization - end-to-end reduces error introduced in each steps of image processing, as each stage is not independent - applied to demosaicing, denoising, deconvolution, and a variety of reconstruction tasks evaluate: - recontruction based, no need for large datasets - classical regularizers that have proven to work well - choice of prior is shown to influence performance, so the choice of prior is important but ad hoc - no principled ways to pick solver parameters, i.e. the weight of regularizers - nonconvexity of the regularizers makes the optimization not guaranteed to converge to global optimum