2019-06-03-overview

Posted on June 3, 2019
# resize pdfs
gs -sDEVICE=pdfwrite -dPDFSETTINGS=/ebook -q -o output.pdf file.pdf
gs -sDEVICE=pdfwrite -dPDFSETTINGS=/prepress -q -o output.pdf file.pdf

Camera

  • 2018_coded_two_bucket_cameras_for_computer_vision
    • abstract
      • coded two-bucket imaging (C2B) for 3D shape estimation. the sensor modulates light arriving at each pixel and outputs 2 images per pixel. When coupled with dynamic light sources acquire illumination mosaics, which can be processsed to acquire live disparity/normal maps of dynamic objects.
    • questions
      • spatial multiplexing (F frames -> 1 frame that is spatially multiplex)
        • achieved with a tiling and a correspondence of frame to pixel sampling scheme
        • what is the motivation for this
          • photometric stereo needs several photos, needs a sequence of this …
          • but scene always move in a video, adapt traditional algoritm (stereo) work under this framework (reconstruction in 1 shot)
          • similar to color imaging (several spectrum 1 in shot)
          • potential for different tiles (4x4 -> recover 16 frames)

Points

  • What metrics used in demosaicking papers ?
    • color
      • MSE/PSNR: error between reference and reconstructed images
      • S-CIELAB: perceptual color fidelity (http://scarlet.stanford.edu/~brian/scielab/introduction.html)
    • artifacts
      • a measure of zipper effect near sharp edges(https://pdfs.semanticscholar.org/9309/59339e8b69b90d18b479dbfa06049f4a5182.pdf)
      • a measure of aliasing (false color)
  • How confident are we in adapting bayer CFA interpolation methods to structured light images ?
    • assumptions that cannot be assumed
      • constant/smooth hue assumption (spectrally, hue is constant/smooth inside boundaries of objects) since the image is grayscale
      • each pixel has
    • assumptions that can be exploited
      • homogeneity assumption (spatially, neighboring pixels morel likely to have similar colors)
    • some demosaicing algorithm exploit structure of bayer color filter array, which is not the case in our case.
      • sequential demosaicing methods (interpolate G channel first, then interpolate R,G) relies on the rationale tht since G channel is sampled more it is less aliased. This is not true for the two bucket camera
  • http://people.duke.edu/~sf59/TIP_Demos_Final_Color.pdf
    • considers super-resolution and demosaicing as the same problem

hyperspectral imaging

  • https://link.springer.com/article/10.1007/s00138-018-0965-4
    • hypterspectral imaging mosaic design
    • discusses
      • how much hyperspectral imaging benefit from spectral and spatial correlation
  • https://link.springer.com/article/10.1007/s11042-018-6396-4
    • http://sesar.di.unimi.it/jdemdb/
    • a new dataset (16-bit images) for benchmarking demosaicing and denoising algorithms
  • Generating Training Data for Denoising Real RGB Images via Camera Pipeline Simulation
    • https://arxiv.org/pdf/1904.08825.pdf
    • more realistic dataset generation with realistic noise characteristics

The list of state of art methods

05.30

  • tabulate the psnrs in a matrix …
    • only >= 2015 papers
  • things to keep track of
    • brief description of methods
      • pro/con
      • interest
    • feasibilty/bottleneck w.r.t. c2b camera
      • easy to use ?
  • some observation on datasets
    • deep learning based ones could be trained with several hundred to 2k images
    • however, this is a problem for c2b cameras, since the dataset would be fixed to particular tiling, and a particular reconstruction pattern
    • just need full-res images ….
    • different pattern for each frame

05.31

  • mosaic design could be really cool as a second step
    • Deep Joint Design of Color Filter Arrays and Demosaicing
      • automatic design of filter arrangement with an autoencoder
  • hook up or implement method myself ?
    • if its easiest to do this …
  • adapt the neural net for two bucket camera
    • i.e. input is two bucket camera
    • can do demosaic and demultiplexing at same time…
  • decide which one is most amenable …
  • pick a few that is reasonable, pros and cons, what does it take to get there
    • write the email
    • propose a few
  • short: 2014_flexISP full: FlexISP: A Flexible Camera Image Processing Framework url: http://www.cs.ubc.ca/labs/imager/tr/2014/FlexISP/FlexISP_Heide2014_lowres.pdf notes: - end-to-end image processing that enforce image priors as proximal operators and use ADMM/primal-dual for optimization - end-to-end reduces error introduced in each steps of image processing, as each stage is not independent - applied to demosaicing, denoising, deconvolution, and a variety of reconstruction tasks evaluate: - recontruction based, no need for large datasets - classical regularizers that have proven to work well - choice of prior is shown to influence performance, so the choice of prior is important but ad hoc - no principled ways to pick solver parameters, i.e. the weight of regularizers - nonconvexity of the regularizers makes the optimization not guaranteed to converge to global optimum