Logo

Programming-Idioms

  • C++
  • Lisp
  • Python
  • Fortran

Idiom #275 Binary digits to byte array

From the string s consisting of 8n binary digit characters ('0' or '1'), build the equivalent array a of n bytes.
Each chunk of 8 binary digits (2 possible values per digit) is decoded into one byte (256 possible values).

  subroutine to_s (s, a)
    use iso_fortran_env, only: int8
    character (len=*), intent(in) :: s
    integer (kind=int8), allocatable, intent(out), dimension(:) :: a
    allocate (a(len(s)/8))
    read (unit=s,fmt='(*(B8.8))') a
  end subroutine to_s

int8 is an eight-bit integer. This uses Fortran's internal I/O with bit format of eight bits to read the array from the string.
#include <string>
#include <vector>
using namespace std;
const size_t n = s.length() / 8;

vector<uint8_t> a(n);

for(size_t block = 0; block < n; block++)
{
    uint8_t acc = 0;
    const size_t start = block * 8;
    for(size_t offset = start; offset < start + 8; offset++)
    {
        acc = (acc << 1) + (s[offset] - '0');
    }

    a[block] = acc;
}
f = lambda x: int(s[x:x + 8], 2)
a = [*map(f, range(0, len(s), 8))]
from re import findall
p = findall('.{8}', s)
a = bytes(int(x, 2) for x in p)
n = (len(s) - 1) // 8 + 1
a = bytearray(n)
for i in range(n):
    b = int(s[i * 8:(i + 1) * 8], 2)
    a[i] = b
import re
p = re.findall('.{8}', s)
a = [*map(lambda x: int(x, 2), p)]
from itertools import batched
f = lambda x: int(''.join(x), 2)
a = [*map(f, batched(s, 8))]

The `batched` function is from Python 3.12+.
#include <stdlib.h>
#include <string.h>
unsigned char *a = calloc(strlen(s) / 8, 1);
for(int i = 0; i < strlen(s); ++i){
  a[i / 8] |= (s[i] == '1') << (7 - i % 8);
}

If s is not 8n characters long or contains characters orher than '0' and '1', correct behavior is not guaranteed.

New implementation...
< >
programming-idioms.org