Logo

Programming-Idioms

  • Perl

Idiom #286 Iterate over characters of a string

Print a line "Char i is c" for each character c of the string s, where i is the character index of c in s (not the byte index).

Make sure that multi-byte characters are properly handled, and count for a single character.

use v5.10;
use utf8;
binmode STDOUT, ":utf8"; 

while ($s =~ /(\X)/g) {
    say 'Char ' . pos($s) . ' is ' . $1;
}

Import pragma 'use utf8' is necessary if string $s is set from a string inside your souce code. If read from stdin, you'll need binmode STDIN, ':utf8' as well. The while loop does a pattern match looking for any unicode character (\X) and repeats that multiple times (the g modifier). The pos() function returns the position of the match. Parens are used around the \X to capture the character and return it in $1. The . operator is for string concatenation.
(dorun (map-indexed (fn [i c] (println "Char" i "is" c)) s))

In Clojure a string is a list of characters

New implementation...