[WIP] Building an optimized and accurate Jpeg Decoder [deprecated] #298

antonfirsov · 2017-08-19T00:51:47Z

Update

Anyone seeing this: The information here no longer up-to date, because we discovered PDF.js being inaccurate on progressive Jpegs. Gonna open an issue soon.

Remarks

At the moment of opening this PR, it's way too far ahead of our current master. We need to merge #275, qa-lab, and jpeg-port to wipe-out unrelated changes. Edit: Even the first comments below the PR are old comments on commits.

We should be able to merge a stable state of the new decoder into master before introducing the biggest breaking changes. In order to achieve this, we should branch jpeg-lab into a separate PR (maybe by reusing #274).

If you wish to contribute developing .NET-s first optimized managed-only open source Jpeg decoder, any help is welcome, both from new and old contributors!

Description

Porting PDF.js with #274 did a good job by introducing a cleaner stream parsing logic, capable for unified, and low-memory management of Baseline and Progressive jpeg-s. However, we will not able to speed up the decoder with integer arithmetics and lookup tables. We need to refactor/replace most of it's calculations with floating point SIMD arithmetics.

Design goals

low-CPU, low-memory, accurate - these trade-offs are natural enemies.
Make it modular and configurable. If we can't win on all fronts in the previous point, we can provide choices for the user.
Unit/Integration test as much internals as possible

Non-goals

Enabling asynchronous scan-by-scan decoding of progressive jpeg-s is not an architectural goal at this moment. We can introduce it in a future version.
More generally: asynchronicity is not a goal at this point.

Architecture

In the first step, the decoder reads all blocks from all scans, storing them in a compact form in spectral space (FrameComponent.BlockData), even for progressive jpegs. The implementation is basically done by porting PDF.js.

In the second step we should process the spectral blocks MCU-row by MCU-row doing the following steps:

Block8x8 -> Block8x8F
IDCT + Quantization on Block8x8-s
SIMD colorspace conversion on float buffers
Pack RBA float buffers into TPixel-s and copy the result into Image<TPixel>
Repeat 1.-4. for each MCU row

Implementation Plan [WIP]

General:

Structs and ref-s are overused (my fault), we can use classes for small objects having only a few instances without performance loss.
Introduce Block8x8 struct for integer blocks.
Split JpegDecoderCore into 3 classes:
- SpectralJpegData: representing Huffmann-decoded, unzigged Block8x8-s for decoded components.
- JpegStreamDecoder: produces a SpectralJpegData by consuming a stream.
Introduce integration test for JpegStreamDecoder: we can convert SpectralJpegData into spectral-space Image<Rgba32>, and test it against spectral-space reference images. We should expect exact equality at this point.
Speed up and the Huffman decoder.

Switch-to-float

Implement tests for the whole spectral space -> RGB conversion chain. We should define expected results per individual block.
Find and implement the most accurate floating-point IDCT algorithm that could be implemented with .NET SIMD capabilities. ImageSharp.Formats.Jpeg.GolangPort.Components.DCT seems to be inaccurate.
Quantization
SIMD colorspace-conversion.
Pack Vector4-s into TPixel-s

…Sharp into jpeg-lab

… (Optimizing PNG-s with external tools from now.)

# Conflicts: # tests/Images/External

# Conflicts: # tests/ImageSharp.Tests/TestFile.cs

JimBobSquarePants · 2017-08-19T07:08:49Z

@antonfirsov I'm closing this one for now so we can create a clean one based on the beta-1 codebase without any additional commits.

JimBobSquarePants and others added 30 commits June 16, 2017 22:56

Begin port

1555f09

Fix header finder

1abe631

Add js source link

d4d74b4

Use buffer

f63f85a

Remove offset

2c629c7

Fix progressive bool assignment

b025ed6

(╯°□°）╯︵ ┻━┻

3728b82

Can now build huffman tables

0ea7a6f

Begin ProcessStartOfScan

a718bf8

Merge branch 'master' into jpeg-port

cd72206

Can now decode a scan

1629819

Begin second phase of decoding

8bbc63f

Impove disposal

c1025a6

Experiment with new file marker finder

549e61f

Merge branch 'master' into jpeg-port

ba8a5b3

Decoder now doesn't break tests

2f501eb

Fix progressive decoding

4a4e94d

baseline decode works progressive nearly

472d6ba

Fix progressive scan decoding

28a8aca

Can now decode many images

ca9bd35

Merge branch 'master' into jpeg-port

69c15e3

Can now decode that bad progressive image

59c0793

Now decodes all images

827ca83

Fix #159

e2d26eb

use an offset span instead of buffer

76e91db

additional usages of Span

db2b712

fixed Sandbox46 execution

5439240

Rough working better Huffman

ea0abc9

Better Huffman decoding

0f60242

Almost got Huffman LUT working

0323d00

JimBobSquarePants and others added 21 commits August 17, 2017 09:49

Use more accurate IDCT

f5b9a8c

.

9da4345

Merge branch 'jpeg-lab' of https://github.com/JimBobSquarePants/Image…

2d80a6f

…Sharp into jpeg-lab

good by GenericFactory!

84852a8

Using Corecompat.System.Drawing as reference encoder/decoder for PNG.…

1562e32

… (Optimizing PNG-s with external tools from now.)

PngDecoder is covered now, and proven to be buggy :P

02eb5f2

covered DetectEdges

1df0010

Merge remote-tracking branch 'origin/antonfirsov/qa-lab' into jpeg-lab

f6904d9

# Conflicts: # tests/Images/External

TestImageProvider.FileProvider cache is now aware of decoder parameters

a103cb8

Merge remote-tracking branch 'origin/antonfirsov/qa-lab' into jpeg-lab

a798d5a

provider.GetImage(new JpegDecoder())

385ed88

let's merge jpeg-port to have the changelog!

c4953b0

Merge remote-tracking branch 'origin/jpeg-port' into jpeg-lab

1959c4a

# Conflicts: # tests/ImageSharp.Tests/TestFile.cs

grouping files for decoders

7383124

moving a few more files

4676d8a

GolangPort namespaces following folder structure

b6d4f35

move Block8x8F into ImageSharp.Formats.Jpeg.Common

1c75403

adjust PdfJsPort namespaces

80380d9

prefixing GolangPort stuff with Old*** #Round1

51b430b

renaming is hard

493deda

introduced OldJpegDecoder : IImageDecoder for the GolangPort decoder

b3b4827

antonfirsov mentioned this pull request Aug 19, 2017

Improve Jpeg Decoder #192

Closed

antonfirsov added formats:jpeg help needed up-for-grabs labels Aug 19, 2017

JimBobSquarePants changed the base branch from master to beta-1 August 19, 2017 06:58

JimBobSquarePants merged commit b3b4827 into beta-1 Aug 19, 2017

JimBobSquarePants mentioned this pull request Aug 19, 2017

Beta 1 #299

Merged

4 tasks

antonfirsov changed the title ~~[WIP] Building an optimized and accurate Jpeg Decoder~~ [WIP] Building an optimized and accurate Jpeg Decoder [deprecated] Aug 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Building an optimized and accurate Jpeg Decoder [deprecated] #298

[WIP] Building an optimized and accurate Jpeg Decoder [deprecated] #298

antonfirsov commented Aug 19, 2017 •

edited

Loading

JimBobSquarePants commented Aug 19, 2017

[WIP] Building an optimized and accurate Jpeg Decoder [deprecated] #298

[WIP] Building an optimized and accurate Jpeg Decoder [deprecated] #298

Conversation

antonfirsov commented Aug 19, 2017 • edited Loading

Update

Remarks

Description

Design goals

Non-goals

Architecture

Implementation Plan [WIP]

General:

Switch-to-float

JimBobSquarePants commented Aug 19, 2017

antonfirsov commented Aug 19, 2017 •

edited

Loading