releasetools: Reduce memory footprint for BBOTA generation. The major issue with the existing implementation is unnecessarily holding too much data in memory, such as HashBlocks() which first reads in *all* the data to a list before hashing. We can leverage generator functions to stream such operations. This CL makes the following changes to reduce the peak memory use. - Adding RangeSha1() and WriteRangeDataToFd() to Image classes. These functions perform the operations on-the-fly. - Caching the computed SHA-1 values for a Transfer instance. As a result, this CL reduces the peak memory use by ~80% (e.g. reducing from 5.85GB to 1.16GB for the same incremental, as shown by "Maximum resident set size" from `/usr/bin/time -v`). It also effectively improves the (package generation) performance by ~30%. Bug: 35768998 Bug: 32312123 Test: Generating the same incremental w/ and w/o the CL give identical output packages. Change-Id: Ia5c6314b41da73dd6fe1dbe2ca81bbd89b517cec

commit: 183e56e83d82e3ff55b382ff1efee9a6495a7bf4 [log] [tgz]
author: Tao Bao <tbao@google.com> Sun Mar 05 17:05:09 2017 -0800
committer: Tao Bao <tbao@google.com> Mon Mar 06 08:46:30 2017 -0800
tree: 78646a3c496973d5037dce2418cb88882c6e700f
parent: 8e02284362d099baff86906de5071483e14c9f69 [diff] [blame]
diff --git a/tools/releasetools/sparse_img.py b/tools/releasetools/sparse_img.py
index 4ba7560..7eb60d9 100644
--- a/tools/releasetools/sparse_img.py
+++ b/tools/releasetools/sparse_img.py

@@ -144,6 +144,12 @@
     f.seek(16, os.SEEK_SET)
     f.write(struct.pack("<2I", self.total_blocks, self.total_chunks))
 
+  def RangeSha1(self, ranges):
+    h = sha1()
+    for data in self._GetRangeData(ranges):
+      h.update(data)
+    return h.hexdigest()
+
   def ReadRangeSet(self, ranges):
     return [d for d in self._GetRangeData(ranges)]
 
@@ -155,10 +161,11 @@
     ranges = self.care_map
     if not include_clobbered_blocks:
       ranges = ranges.subtract(self.clobbered_blocks)
-    h = sha1()
-    for d in self._GetRangeData(ranges):
-      h.update(d)
-    return h.hexdigest()
+    return self.RangeSha1(ranges)
+
+  def WriteRangeDataToFd(self, ranges, fd):
+    for data in self._GetRangeData(ranges):
+      fd.write(data)
 
   def _GetRangeData(self, ranges):
     """Generator that produces all the image data in 'ranges'.  The
commit	183e56e83d82e3ff55b382ff1efee9a6495a7bf4	[log] [tgz]
author	Tao Bao <tbao@google.com>	Sun Mar 05 17:05:09 2017 -0800
committer	Tao Bao <tbao@google.com>	Mon Mar 06 08:46:30 2017 -0800
tree	78646a3c496973d5037dce2418cb88882c6e700f
parent	8e02284362d099baff86906de5071483e14c9f69 [diff] [blame]