Flutter is Google’s portable UI framework based on Dart language. It allows the deployment of web apps and native apps on iOS, Android, Windows, Mac, Linux and others using mostly the same source code.
KaiOS
KaiOS is an operating system for feature phones with a keypad (no touchscreen) as the Nokia 8110 4G or the Accent Nubia 50k for example. Such phones have long-lasting batteries, are cheap, robust and simple to use. This explains why KaiOS phones had a global market share of 0.69% in 2019 and reached 3.68% within India, beating iOS.
While not iOS nor Android based, KaiOS also has an applications store, the KaiStore. The applications that can be deployed there are web applications written in HTML5.
The question
As Flutter is very portable and as there are millions of KaiOS phone users, is there a way to make a Flutter app compatible with KaiOS? This would allow the use of the same source code for an additional platform.
The approach
The approach used to answer the question consisted in creating a proof of concept application in the form of a minimalist chat Flutter application: chattack
This application is also available as a web application on https://chat.jod.li
Once you create a user, you can join the following shared chat and see whether someone answers:
Chat name: zorro
Chat shared key: iamzorro
The implementation of a chat application involves communication between several phones and proves that many applications are possible.
The battles
This proof of concept application came with some struggles.
Replace the icons font file by a reduced custom one (some 800 KB gained with this).
Websockets
Websockets are used by the app to receive messages from the server. The package used for websockets within a web Flutter app is not the same as the one for native apps but the functioning is the same.
While websockets were working fine on my Linux desktop, KaiOS phone, Android phone and Opera on Mac, they were not working on iOS and Safari on Mac. As explained in a websocketd GitHub issue, this was due to empty headers used in the command line to start the websocket script.
Keypad instead of touchscreen
Keypad interactions are at the core of the app UX because there is no touchscreen on KaiOS phones. Flutter was not meant to deal with this. Coping with the absence of touchscreen was far more difficult than anticipated. This involved:
Setting a focus text and a focus action for each interactive element of the UI.
Capturing the keypad taps.
Filtering out the false double taps in particular on the enter key.
Mapping the tapped keys with user interactions.
Traveling between focusable elements whenever relevant depending on the actions. This was not as easy as it seems because the FocusNode.nextFocus() method doesn’t always lead to the next focusable element.
Scrolling some elements depending on the context and the tapped keys.
Displaying the bottom context bar which shows the behavior of the top keypad buttons.
Those adaptations reminded me of those involved in the preparation of a website or app for accessibility for the visually impaired. As such, developing an app for keypad based interactions could be an excellent way to focus the design and development efforts on those important accessibility aspects.
Note that typing long text on a keypad is far from being fast and comfortable for most users. To cope with this, KaiOS comes with Google assistant based dictation which works reliably on Flutter text fields as shown in this video.
Performance
The Flutter web app release comes out built minified. Even then, the app interaction is very slow on the KaiOS device (in my case an Accent Nubia 50k) while it is very acceptable on desktop Opera for example.
Here is a comparison of the time it takes for the same interactions on the same network:
Action
Desktop Opera time
KaiOS app time
Focus next element with arrows
Instantaneous
Almost a second
Login and load chat with 80 messages
<2s
>4s
Load register screen with captcha image
<2s
>4s
Open rules page
Instantaneous
Instantaneous
Some execution times are much shorter for Opera than for KaiOS. This doesn’t seem like a good indication for potential optimization. However, the last line shows that some interactions can happen instantaneously on a KaiOS app.
Conclusion
Knowing whether Flutter can be used to develop applications for KaiOS is important because this could increase even more the portability of Flutter applications.
Flutter can be used on KaiOS under the following conditions:
The application has an acceptable performance because of the following points:
The population targeted is more interested in features than performance.
The app is designed avoiding the interactions that are slow on KaiOS with Flutter (as it was the case for displaying the rules in the proof of concept chat app).
KaiOS is improved to make the execution of the application faster either by accepting native applications or by improving the web app execution engine.
KaiOS accepts the app in the KaiStore. This has not been clarified yet.
Here is a list of application examples that could potentially already be developed with Flutter, run on KaiOS and also be deployed on other platforms:
Elderly people acquaintances emergency alert.
White noise app for baby sleep.
Exam practice quiz.
The preparation of a Flutter app for keypad interactions could be done together with its prepration for accessibility for the visually impaired.
Further reading
A very interesting, well written and more up to date article was made available on KaiOS.dev on the topic of Flutter development for KaiOS on February 2024.
Future
Here is a list of potential future changes on the topics discussed above:
Flutter plugin to automatically reduce the size of a Flutter web app using the tricks mentioned in the “Project size” section.
Flutter plugin to deal with keypad interaction instead of touch screen.
Try to deploy a Flutter app to the KaiStore to know whether this can be accepted. This was attempted on 2021/03/16 with the status “Testing” as of 2021/05/01.
Disclaimer
I didn’t receive any money or incentive of any kind from anyone to write this article. I don’t work for any of the companies involved in the products or projects mentioned.
ASCII art in PHP relies on the selection of a set of characters for the graphic representation. We didn’t find an optimal selection of characters mentioned anywhere. The example character sets seem to be chosen arbitrarily.
Approach
We consider as an acceptable approximation that the number of pixels used to display a font can be used to chose how dark/bright a character is in the ASCII art representation. With this, we use the 5 standard fonts within PHP to count the pixels for each ASCII character which decimal code is within (32-126). The result is the following for example for PHP font 5:
With no surprise, the minimum number of pixels is 0 for the space.
We divide the pixels space by the desired number of characters to find the best spread characters. We do this until we reach a situation where there is no character repetition. This occurs for for all fonts with sets of 9 characters.
PHP code
The following PHP code allowed to reach the desired outcome:
<?php
if(isset($argv[1]) && strlen($argv[1] && $argv[1]>1)) {
$nbChars = $argv[1];
}else{
echo "Please Specify a number of characters > 1\n";
exit(1);
}
//$font must be 1-5
function nbPixelsForCharFont($char,$font){
//inspired by https://www.php.net/manual/en/function.imagestring.php
$h=15;
$w=15;
// Create a sized image
$im = imagecreate($h, $w);
// White background and black text
$bg = imagecolorallocate($im, 255, 255, 255);
$textcolor = imagecolorallocate($im, 0, 0, 0);
// Write the string at the top left
imagestring($im, $font, 0, 0, $char, $textcolor);
$colorsSum=0;
for($y=0;$y<$h;$y++){
for($x=0;$x<$w;$x++){
$color=imagecolorat ( $im , $x , $y );
$colorsSum+=$color;
}
}
imagedestroy($im);
return($colorsSum);
}
function buildCharIntForNbPxArray($font){
$charIntForNbPx=array();
for($i=32;$i<127;$i++){
$c=chr($i);
$nbPx=nbPixelsForCharFont($c,$font);
$charIntForNbPx[$nbPx]=$i;
}
ksort($charIntForNbPx);
return($charIntForNbPx);
}
function findClosestKey($arr,$desiredKey,$reach=0){
$r=(int)round($desiredKey);
$high=$r+$reach;
$last=array_key_last($arr);
if($last>=$high&&array_key_exists($high,$arr))
return($high);
$low=$r-$reach;
$first=array_key_first($arr);
if($first<=$low&&array_key_exists($low,$arr))
return($low);
if($last>=$high||$first<=$low)
return(findClosestKey($arr,$r,$reach+1));
return(null);
}
function findBestChars($font,$nbChars){
$bestChars=array();
$charIntForNbPx=buildCharIntForNbPxArray($font);
$minPx=array_key_first($charIntForNbPx);
$maxPx=array_key_last($charIntForNbPx);
$idealStep=($maxPx-$minPx)/($nbChars-1);
for($i=$minPx;$i<=$maxPx;$i+=$idealStep){
$key=findClosestKey($charIntForNbPx,$i);
if(is_null($key))
echo("Error, char shouldn't be null!\n");
$bestChars[]=$charIntForNbPx[$key];
}
return($bestChars);
}
echo("Best chars for ascii art per PHP font for ".$nbChars." characters:\n");
for($font=1;$font<6;$font++){
$prev=null;
echo("font ".$font." best chars:\n");
foreach(findBestChars($font,$nbChars) as $char){
echo($char.":".chr($char)."\n");
if($prev==$char)
echo("ALERT!!! same as previous!!!\n");
$prev=$char;
}
echo("\n");
}
?>
Outcome
The outcome is the following table of optimized character sets:
Font 1
Font 2
Font 3
Font 4
Font 5
32:
32:
32:
32:
32:
94:^
96:`
46:.
46:.
96:`
95:_
34:”
96:`
58::
94:^
120:x
59:;
33:!
114:r
124:|
125:}
118:v
63:?
108:l
49:1
71:G
122:z
120:x
70:F
97:a
81:Q
121:y
121:y
109:m
85:U
64:@
72:H
87:W
82:R
66:B
35:#
87:W
78:N
81:Q
78:N
Comparison
To compare the sets, we used the following PHP code inspired by this page:
Example setFont 1 setFont 2 setFont 3 setFont 4 setFont 5 set
Our favorite is the font 5 set.
Adaptations
As the ASCII art representation depends on the selected font, the above explained approach can be adapted to the situation. Also, the number of characters can easily be either increased or decreased.
At jod.li, we started developing Flutter apps in June 2019 (9 months ago). We needed a cross-platform mobile development solution. We chose Flutter because it is evolving fast, yet, it is stable enough for robust development. There is a large community. Also, with Google as the underlying force behind, things are less likely to go wrong. Finally, we tried several other solutions without being convinced and sometimes even being very disappointed (several months of development on a non-Flutter solution and after an update, we ended up with an unrecoverable amount and complexity of compilation errors).
So, Flutter seemed like a good choice.
An example app
We have grown enough knowledge in Flutter to develop side projects in a few days as we need it. During the covid-19 pandemic, the cryptocurrencies are going crazy. Thus, not having a notification on Binance when a price reaches a certain threshold appeared like a problem easy to fix with a Flutter app.
We have thus developed an app that does just that relying on the Binance API.
App size
Came the time of publication on the Google Play Store. The Android App ends up with a very reasonable size smaller than 7 MB.
However, the iOS app once installed on an iPhone takes 407.1 MB. This is 5 times the 84.2 MB the Google Chrome app takes on the same iPhone. This is of course not reasonable.
This is a time when we realise that we may have taken – again – the wrong road for several months with Flutter. All of this because of this final app size on iOS.
It says that the embedded Flutter framework file (the vast majority of the bytes in the app) is going to be reduced in the app before reaching the user mobile devices. Well, we’d like to see this happening before we pay the 100 bucks to Apple for the publication.
Preparing the app for the store
So to get closer to the store reality, we follow the iOS Flutter publication guidelines:
The archive size is even bigger than the debug app size on the iPhone: 577.4 MB
Let’s go further anyway.
Deploy to Apple app store
We pay the 100 bucks for a deployment in the Apple app store.
Few hours later, we receive the confirmation email and create the app in the app store.
Small trick, the store icon cannot contain any alpha layer both in the app store and in the app archive (1024×1024 px).
Back in Xcode, at the archive validation step, there is indeed a mention that the bitcode will not be included and that Swift symbols can be striped.
Since the archive is extremely big, the uploading takes ages with an annoying forever 100% progress bar.
Permission and other issues
After all this time uploading, the store sends a rejection email with the following permission related issues:
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSContactsUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSCalendarsUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSAppleMusicUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSMotionUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSSpeechRecognitionUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
Though you are not required to fix the following issues, we wanted to make you aware of them:
ITMS-90078: Missing Push Notification Entitlement - Your app appears to register with the Apple Push Notification service, but the app signature's entitlements do not include the "aps-environment" entitlement. If your app uses the Apple Push Notification service, make sure your App ID is enabled for Push Notification in the Provisioning Portal, and resubmit after signing your app with a Distribution provisioning profile that includes the "aps-environment" entitlement. Xcode does not automatically copy the aps-environment entitlement from provisioning profiles at build time. This behavior is intentional. To use this entitlement, either enable Push Notifications in the project editor's Capabilities pane, or manually add the entitlement to your entitlements file. For more information, see https://developer.apple.com/library/content/documentation/NetworkingInternet/Conceptual/RemoteNotificationsPG/HandlingRemoteNotifications.html#//apple_ref/doc/uid/TP40008194-CH6-SW1.
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSLocationAlwaysUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
ITMS-90683: Missing Purpose String in Info.plist - Your app's code references one or more APIs that access sensitive user data. The app's Info.plist file should contain a NSLocationWhenInUseUsageDescription key with a user-facing purpose string explaining clearly and completely why your app needs the data. Starting Spring 2019, all apps submitted to the App Store that access user data are required to include a purpose string. If you're using external libraries or SDKs, they may reference APIs that require a purpose string. While your app might not use these APIs, a purpose string is still required. You can contact the developer of the library or SDK and request they release a version of their code that doesn't contain the APIs. Learn more (https://developer.apple.com/documentation/uikit/core_app/protecting_the_user_s_privacy).
To summarise, The app requires permissions that are not needed for its execution. For Apple not to reject the build, we need to mention those permissions as not needed in the Podfile:
post_install do |installer|
installer.pods_project.targets.each do |target|
target.build_configurations.each do |config|
# Here are some configurations automatically generated by flutter
# You can remove unused permissions here
# for more infomation: https://github.com/BaseflowIT/flutter-permission-handler/blob/develop/ios/Classes/PermissionHandlerEnums.h
# e.g. when you don't need camera permission, just add 'PERMISSION_CAMERA=0'
config.build_settings['GCC_PREPROCESSOR_DEFINITIONS'] ||= [
'$(inherited)',
## dart: PermissionGroup.calendar
'PERMISSION_EVENTS=0',
## dart: PermissionGroup.reminders
'PERMISSION_REMINDERS=0',
## dart: PermissionGroup.contacts
'PERMISSION_CONTACTS=0',
## dart: PermissionGroup.camera
'PERMISSION_CAMERA=0',
## dart: PermissionGroup.microphone
'PERMISSION_MICROPHONE=0',
## dart: PermissionGroup.speech
'PERMISSION_SPEECH_RECOGNIZER=0',
## dart: PermissionGroup.photos
'PERMISSION_PHOTOS=0',
## dart: [PermissionGroup.location, PermissionGroup.locationAlways, PermissionGroup.locationWhenInUse]
'PERMISSION_LOCATION=0',
## dart: PermissionGroup.notification
'PERMISSION_NOTIFICATIONS=0',
## dart: PermissionGroup.mediaLibrary
'PERMISSION_MEDIA_LIBRARY=0',
## dart: PermissionGroup.sensors
'PERMISSION_SENSORS=0'
]
end
end
end
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
<key>aps-environment</key>
<string>Write here your reason for using push notifications.</string>
</dict>
</plist>
Invalid Bundle. iPad Multitasking support requires these orientations: 'UIInterfaceOrientationPortrait,UIInterfaceOrientationPortraitUpsideDown,UIInterfaceOrientationLandscapeLeft,UIInterfaceOrientationLandscapeRight'. Found 'UIInterfaceOrientationPortrait' in bundle ...
We also got the build number error although no app was actually successfully published yet (we just increase the build number to 1.0.1):
ERROR ITMS-90189: "Redundant Binary Upload. You've already uploaded a build with build number '1.0.0' for version number '1.0.0'. Make sure you increment the build string before you upload your app to App Store Connect. Learn more in Xcode Help (http://help.apple.com/xcode/mac/current/#/devba7f53ad4)."
Nearly 2 hours of upload later, finally some good news:
We complete the submission on the Apple App Store Connect site.
Publication
6 hours later, the app is in review and 30 minutes later, the app is ready for sale.
Few hours later, the app is published on the app store and the resulting app size is a very reassuring 25.2 MB.
Conclusion
We should not worry too much about the app size before it is published. Once published, the app size is reduced considerably (archive 577 MB -> iPhone from app store 25 MB).
This first Flutter iOS Apple App Store Connect publishing made us aware of the following traps:
App store icon cannot contain transparency layer (1024×1024).
The archive is so big that the publication takes a very long time. Thus, automating this step is highly desirable.
When using the permission handler, include all the non-needed permissions in the Podfile.
With Flutter, always include a ios/Runner/Runner.entitlements file for the push notification permissions even if not used.
If not all orientations are supported on iPad, disable multitasking by supporting only fullscreen.
Always increase build number even if no application version was published.
The purpose of those experiments was to determine whether an interlaced PNG image could be smaller in size than the equivalent non-interlaced PNG image.
It was already explained here why interlaced PNG tend to be larger in size than the non-interlaced PNG equivalent image. Here was the conclusion:
“The conclusion is that the uncompressed data length of identical PNG images (with pixel size >= 8) will almost always be greater in the interlaced case because more filter types must be described in the file. Though, the compression (deflate) applied to the PNG data can lead to some particular cases if we compare the resulting PNG file sizes.”
So, the purpose here is to find those particular cases.
For the experiments to be valid, here is what must be identical between the interlaced and the non-interlaced images (based on this StackOverflow response which is based on the PNG RFC2083):
Non-pixel encoding related content embedded in the file such as palette (in the case of color type =! 3) and non-critical chunks such as chromaticities, gamma, number of significant bits, default background color, histogram, transparency, physical pixel dimensions, time, text, compressed text.
Different pixel encoding related content such as bit depth, color type (and thus the use of palette or not with color type = 3), image size,… .
Pixels, compression level.
Method used
The method used consisted in evolving through the auto optimised distributed https://oga.jod.li genetic algorithm with niche mechanism with the chromosome defining the shape and pixels of an image. Creating the interlaced PNG and the non-interlaced versions with ImageMagick. Comparing the file sizes in bytes. The image depth chosen is 8. The chromosome fitness is defined by the non-interlaced file size minus the interlaced file size offset by 200 to avoid reaching 0.
The auto optimisation: true (more details on this here)
The mutation rate: 0.3 (initial value only as it is auto evolved)
The crossover rate: 0.2 (initial value only as it is auto evolved)
The elitism rate: 0.5 (initial value only as it is auto evolved)
The niche distance ratio (ndr): 10 (initial value only as it is auto evolved) (more details on niche mechanism here)
The niche power sigma (psi): 1 (initial value only as it is auto evolved) (more details on niche mechanism here)
The minimum length: 2
The maximum length: 3000
The encoding: xdigit (hexadecimal)
The envelope: CSV
The number of chromosomes per batch: 40
Experiments
General shape
General shape search with no constraint except the maximum size due to the maximum length of the chromosome (3000). This is the same experiment as the one mentioned in the oga.jod.li auto optimisation parameter page.
Here are the results:
General shape batch fitness plot on oga.jod.li
The above batch fitness plot shows that:
The 1 pixel height plateau with a maximum fitness of 197 lasts 27 batches (1080 chromosomes). Indeed, it has been noticed, during those experiments that the evolution starts with a local optima for the PNG images with a height of 1 that leads to a maximum fitness of 197 (meaning that the non-interlaced PNG file is larger than the interlace PNG file as the fitness is offset by 200).
The maximum fitness is 222 after 168 batches (6720 chromosomes).
The maximum fitness did not improve for the next 754 batches (30160 chromosomes). Though, under the right circumstances, it is possible to reach a fitness of 230 as shown below.
As expected and shown below, the genetic algorithm parameters evolved over time due to the auto parameter being enabled:
General shape batch parameters plot on oga.jod.li
Not pattern or trend seems to be readable from the above plot. To the human eye, the parameter choices seem random.
The PNG image corresponding to the best chromosome is the following one (width 1 x height 505, 2036 bytes interlaced, 2058 bytes non-interlaced):
The shape of the image seems to be important for the fitness as the best chromosome converts to a tall image. Thus, the next experiment will constraint the image shape to a one pixel width.
One pixel width
As the best shape for a smaller interlaced PNG image file size compared to the same image non-interlaced is a tall image from previous experiment, this experiment constraints the shape to a one pixel width image.
The files for this experiment can be found here: 1px .
The outcome of the experiment are the following ones:
One pixel width batch fitness plot on oga.jod.li
The above plot shows that:
No one pixel width case had a fitness lower than 180.
Maximum fitness is 230 reached after batch 105 (4200 chromosomes). This means that for the best chromosome, the interlace PNG file size is 30 bytes larger than the same image non-interlaced with the constraint on the maximum chromosome length (3000).
The beginning of the evolution shows a fitness around 200. Which means that the 1 pixel width images interlace PNG files tend to have the same size as their interlaced version.
As expected and shown below, the genetic algorithm parameters evolved over time due to the auto parameter being enabled:
One pixel width batch parameter plot on oga.jod.li
Not pattern or trend seems to be readable from the above plot. To the human eye, the parameter choices seem random.
The PNG image corresponding to the best chromosome is the following one (width 1 x height 996, 3757 bytes interlaced, 3787 bytes non-interlaced):
For the sake of observation, the above image has been split in chunks of 8 pixels high (size of the PNG Adam7 interlace pattern) put side by side and scaled up below:
No particular pattern is visible on the above image. The pixels chosen by the evolution seem to be random to the human eye.
Random one pixel width 996 pixels height
In order to know whether the fitness is due only to the shape or whether the pixels content matter, several PNG images have been generated using random pixels and the interlaced PNG size has been compared to the non-interlaced PNG size. The script is available here: random_1px_band_png.sh .
Here are the results:
Comparing 1×996 random pixels PNG sizes
One can see from the above screenshot the following corresponding fitnesses (offset 200):
199
208
208
200
197
199
189
209
189
191
197
This shows a good repartition around 200 but always below 230, which means that the fitness is not just a matter of image shape but also a matter of pixels content.
Why is the fitness higher for tall images?
As explained here, the main reason why the interlaced images are usually larger than the non-interlaced ones is because of the number of filters that need to be encoded. The number of filters for the non-interlaced case is the number of lines while for the interlaced case it can be calculated as follows:
This leads to a total number of filters encoding in the interlaced case of: 996
This is the same number for the non-interlaced case.
Therefore, the cost for filters encoding can be the same for 1 pixel width images and the pixel content can be rearranged by the interlacing passes so that the filtering and compression lead to a smaller size interlaced than non-interlaced.
Conclusion
To minimise the interlaced PNG image size compared to the non-interlaced PNG image size, the oga.jod.li genetic algorithm used a 1 pixel width tall image. One pixel width tall image is a reasonable way to maximise the size difference between non-interlaced and interlaced as the number of filter encoding can be the same while the number of pixels can be significant enough to lead to a rearrangement by interlacing passes so that the filtering and compression lead to a smaller size interlaced than non-interlaced. Once constrained to this shape, the evolution tends to choose the longest images. If chosen randomly, the pixels content of such image lead to a lower fitness than the best chromosome found through evolution. This means that both, the shape and the pixels content are important to have an interlaced PNG image file size smaller than the non-interlaced equivalent image.
Future work
Confirm those findings with another tool than ImageMagick as the filtering and compression choices depend on the PNG image generator.
I would like to know by advance the length of the uncompressed data within a PNG image whether it is encoded interlaced or not. Knowing and comparing the size of interlaced against non-interlaced PNG uncompressed (inflated) data can allow a wiser choice when encoding a PNG image and can also help while decoding. This can also explain why PNG images are usually greater when interlaced. All the following explanations are based on the PNG RFC. I focus on the cases when the pixel size is >= 8 bits.
Interlaced and non-interlaced: what changes in the PNG file?
Let’s assume 2 identical PNG images, one non-interlaced and one interlaced.
The differences between the files describing those images will be:
– In IHDR chunk: Interlace method (0 for non-interlaced, 1 (Adam7) for interlaced). This doesn’t change the file size by itself.
– In IDAT chunks: Compressed (deflated) scanlines starting with filter type containing the encoded image data. This usually changes the file size as explained below.
Uncompressed data size
The uncompressed (inflated) data length can be calculated as follows for the cases when the pixel size is >= 8 bits.
Non-interlaced case
uncompressed_data_length = height * width * nb_bits_per_pixel / 8 + height
Where nb_bits_per_pixel = nb_sample_per_pixel * bit_depth
Where nb_sample_per_pixel depends on the color_type:
color_type to nb_sample_per_pixel:
0 (grayscale) -> 1
2 (RGB) -> 3
3 (palette based) -> 1
4 (grayscale with alpha) -> 2
6 (RGB with alpha) -> 4
So, for example, if we assume a non-interlaced image with the following characteristics, we get an uncompressed data size of 1695885:
height: 677
width: 626
color_type: 6
=> nb_sample_per_pixel: 4
bit_depth: 8
=> nb_bits_per_pixel: 32
Note the “+ height” at the end of the formula giving the uncompressed_data_length. This is due to the fact that for each scanline, a leading filter type byte is necessary and there are as many scanlines as there are lines in a non-interlaced image.
So, for example, if we assume an interlaced image with the following characteristics, we get an uncompressed data size of 1696479:
height: 677
width: 626
color_type: 6
=> nb_sample_per_pixel: 4
bit_depth: 8
=> nb_bits_per_pixel: 32
nb_pass1_lines = 85
nb_pass2_lines = 85
nb_pass3_lines = 85
nb_pass4_lines = 170
nb_pass5_lines = 169
nb_pass6_lines = 339
nb_pass7_lines = 338
=> sum_lines_per_pass = 1271 (which is greater than height 677 for the non-interlaced case)
Note the “+ sum_lines_per_pass” at the end of the formula giving the uncompressed_data_length. This is because the number of filter type needs to be added. Each scanline has a leading filter type. Having passes for the interlaced case leads to a different number of filter types.
Comparison of uncompressed data length formula whether interlaced or not
Is the number of filter types for the interlaced case always going to be greater than the one for the non-interlaced case because of the passes?
It is easy to show that the number of lines for passes 1, 2, 4 and 6 contribute already for at least a number equivalent to height when width > 4. Indeed:
CEIL(height/8) + CEIL(height/8) + CEIL(height/4) + CEIL(height/2) >= height
The number of lines for passes 3, 5 and 7 can contribute to increase the number of filters for the interlaced case above the number of filters for the non-interlaced case.
Thus, the number of filters for the interlaced case will almost always be greater than the one for the non-interlaced case.
Why is this only “almost” always greater?
Let’s take the following particular example:
height: 1
width: 1
color_type: 6
=> nb_sample_per_pixel: 4
bit_depth: 8
=> nb_bits_per_pixel: 32
nb_pass1_lines = 1
nb_pass2_lines = 0
nb_pass3_lines = 0
nb_pass4_lines = 0
nb_pass5_lines = 0
nb_pass6_lines = 0
nb_pass7_lines = 0
=> sum_lines_per_pass = 1 (which is equal to height 1)
This means that the uncompressed data length should be the same for a 1 pixel PNG image whether interlaced or not (particular case).
Conclusion
The conclusion is that the uncompressed data length of identical PNG images (with pixel size >= 8) will almost always be greater in the interlaced case because more filter types must be described in the file. Though, the compression (deflate) applied to the PNG data can lead to some particular cases if we compare the resulting PNG file sizes.