本帖最后由 8899 于 2024-9-20 10:25 编辑
https://github.com/sz3/cimbar 二维码里面可以藏30m文件 |
早上刷到这个抖音了 |
cimbar is a proof-of-concept 2D data encoding format — much like QR Codes, JAB codes, and Microsoft’s HCCB.
[A non-animated mode-B cimbar code] Cimbar encodes data in a grid of symbols (or icons). There are 16 possible symbols per tile (position) on the grid, encoding 4 bits per tile. In addition, 2-3 color bits can be encoded per position on the grid, meaning up to 7 total bits per tile. 4 bit cimbar encoding There are multiple color schemes: "dark" mode — meant for backlit computer screens, with bright tiles on a black background Cimbar was inspired by image hashing. On a per-tile basis, the cimbar decoder compares the tile against a dictionary of 16 expected tiles — each of which maps to a 4 bit pattern — and chooses the one with the closest imagehash, as measured by hamming distance. Similarly, the decoder will compare the average color of a tile to a dictionary of (ex: 4) expected colors, and choose the one with the closest color. Error correction is applied on the resulting bit stream, and the bit stream itself is interleaved across the image — that is, adjacent tiles do not contain adjacent data — to tolerate localized errors in a source image. Yes, and I can prove it. encoder: https://cimbar.org (uses libcimbar) The main constraints cimbar must deal with are: all tiles in the tileset must be sufficient hamming distance away from each other, where sufficient is determined by whether the decoder can consistently place blurry or otherwise imperfect tiles in the correct "bucket". Cimbar is designed to deal with some lossiness. In practice, the source image should be around 700×700 resolution or greater, in focus, and with some color correction handled by the camera — as you’ll hopefully find in any modern cell phone. This python cimbar implementation is a research project. It works, but it is slow, and does not handle error cases with much grace. libcimbar, the C++ implementation, has been much more heavily optimized and tested. The target goals of the proof-of-concept were: achieve data density on the order of 10kb per image. I want numbers! a mode B (8×8, 4-color, 30/155 ecc, 6-bits-per-tile) cimbar image contains 9300 raw bytes of data, and 7500 bytes with the default error correction level (30) I want to try it! Encoding: python -m cimbar.cimbar –encode myinputfile.txt encoded.png Decoding: python -m cimbar.cimbar encoded.png myoutputfile.txt There are also some utility scripts, such as the one to measure bit errors: python -m cimbar.cimbar encoded.png -o clean.txt –deskew=0 –ecc=0 |
|