From: Nick Alcock <nick.alcock@oracle.com>
Date: Fri, 9 Jan 2015 19:23:20 +0000 (+0000)
Subject: ctf: speed up dwarf2ctf by avoiding ctf_update() calls
X-Git-Tag: v4.1.12-92~313^2~21
X-Git-Url: https://www.infradead.org/git/?a=commitdiff_plain;h=fe5014ae74e00f2f21d2eaa87f7687b57a763539;p=users%2Fjedix%2Flinux-maple.git

ctf: speed up dwarf2ctf by avoiding ctf_update() calls

By far the slowest operation in dwarf2ctf is ctf_update().  This operation
serializes all CTF data then essentially reopens the CTF file using that data.
For large CTF files like the vmlinux and shared type repositories this is
extremely slow, yet we do it between every top-level type and variable and
sometimes even more often than that.  We are doing this for three reasons:

 - firstly, in case of error we want to be able to throw away the erroneous
   type and any new types we may have added that it depends on but nothing
   else does.  libdtrace-ctf's only API for this is ctf_discard(), which
   throws away all types added since the most recent ctf_update(): so we
   must call ctf_update() frequently just to give a point to discard to.

 - secondly, whenever a new structure is encountered we need to count the number
   of members in it to tell whether it is at least as large as any
   previously-encountered instance in scope in this CTF file or the shared one,
   and if it is, to note that the already-existing structure should be updated
   rather than creating a new one: we did this by iterating over the members in
   the DWARF and in the previously-encountered instance in the CTF.  That meant
   the CTF type had to be lookable-up... and you can't look up newly-added types
   in CTF until you've called ctf_update() on the CTF file containing those
   types.

 - thirdly, libdtrace-ctf internally looks up the type of structure members to
   determine their size (and thus the offset of the next element).

We avoid the first requirement by using the new ctf_snapshot()/ctf_rollback()
machinery.  ctf_snapshot() is radically faster than ctf_update(): it just grabs
a couple of integers, increments one, and returns them.  So we can usually avoid
calling ctf_update() between every emitted type, though we do then have to call
it for all the CTF files just before we emit them. (But that happens only once
for a given file.)

We avoid the second requirement by maintaining our own count of structure and
union members in the recently-added per_module hash: this count is itself a hash
mapping a variant of CTF structure names (an "s" or "u", then a space, then the
name) to a structure containing the CTF file that contains this structure
definition, its type ID, and a count of its members.  (We use this notation
because we want to know if a structure has the same C name as another structure,
even if its type ID differs because it was declared in a different place).  We
can then check this structure's count to see if a structure of this name already
exists, and return its CTF ID (recorded when the structure was emitted for the
first time).  The member-emission code, obviously, has to pull out this count
and increment it whenever it emits a new member (even a blacklisted one which is
otherwise ignored, since we are interested in the size of the original
structure *in the DWARF*).  This data is never thrown away, because built-in
modules can come in and out of scope, interspersed with each other and with
always-built-in kernel code, as we walk through vmlinux.o.

The third requirement, alas, cannot be avoided, but we can reduce the overhead
of it somewhat.  Rather than unconditionally doing a ctf_update() whenever we
insert a new type and whenever we insert a new structure tag (just in case this
structure refers to its own tag), we err on the side of optimism and try to
insert the member anyway, and *only if it fails* with a bad ID error do we do an
update.  The bad ID might happen because the type of a member is not known, or
because a type that member depends upon is not known: either of these might be
in the shared type repository.  Again, we err on the side of optimism and
ctf_update() the non-shared CTF file first -- it's probably much smaller and
will update faster.  Then we try adding the member again, and only if *that*
fails do we ctf_update() the share repository.  If insertion still fails after
that, it's a real error, and is treated as such.

This third requirement conflicts with the first a bit -- we have now called
ctf_update() after a ctf_snapshot() was carried out, which prevents us from
rolling back in case of error.  So we look for an ECTF_OVERROLLBACK error from
ctf_rollback(), and if one happens, we do a ctf_discard() instead, throwing away
all the types inserted after that ctf_update().  This will leave some redundant
types around (those inserted before the ctf_update()), but now they are
ctf_update()d there's no way we can get rid of them: this is only an error case
anyway.

The net effect of this is a radical speedup of dwarf2ctf: in my testing my draft
UEK4 merge tree sees this speedup:

user 563 / sys 173 / elapsed 10:09
 ->
user 272 / sys 10  / elapsed 4:56

We update the documentation at the same time.

This does change the set of dependent types, but only from one buggy set to
another (a fix to that bug is next on the agenda: the bug only affects a
limited subset of modules, which is why nobody has ever noticed it before).

Orabug: 20229506

Signed-off-by: Nick Alcock <nick.alcock@oracle.com>
Acked-by: Kris Van Hees <kris.van.hees@oracle.com>
---

diff --git a/scripts/dwarf2ctf/dwarf2ctf.c b/scripts/dwarf2ctf/dwarf2ctf.c
index 9fb8e3a39765f..b683953f8c13b 100644
--- a/scripts/dwarf2ctf/dwarf2ctf.c
+++ b/scripts/dwarf2ctf/dwarf2ctf.c
@@ -115,8 +115,22 @@ typedef struct per_module {
 	 */
 	ctf_file_t *ctf_file;
 
+	/*
+	 * A hash from a "CTF-form" structure name (in the form 's/u NAME') to
+	 * a ctf_memb_count_t (see below).
+	 */
+	GHashTable *member_counts;
 } per_module_t;
 
+/*
+ * A count associating a type ID relating to a structure or union with a count
+ * of members in that structure.
+ */
+typedef struct ctf_memb_count {
+	ctf_id_t ctf_id;
+	size_t count;
+} ctf_memb_count_t;
+
 /*
  * Get a ctf_file out of the per_module hash for a given module.
  */
@@ -613,17 +627,6 @@ static int find_ctf_encoding(struct type_encoding_tab *type_tab, size_t size);
  */
 static long count_dwarf_members(Dwarf_Die *die);
 
-/*
- * Count the number of members of a CTF aggregate.
- */
-static long count_ctf_members(ctf_file_t *fp, ctf_id_t souid);
-
-/*
- * Increment said count.
- */
-static int count_ctf_members_internal(const char *name, ctf_id_t member,
-				      ulong_t offset, void *count);
-
 /*
  * Given a DIE that may contain a type attribute, look up the target of that
  * attribute and return it, or NULL if none.
@@ -1197,6 +1200,9 @@ static ctf_file_t *init_ctf_table(const char *module_name)
 	}
 
 	new_per_mod->ctf_file = ctf_file;
+	new_per_mod->member_counts = g_hash_table_new_full(g_str_hash,
+							   g_str_equal,
+							   free, free);
 	g_hash_table_replace(per_module, xstrdup(module_name), new_per_mod);
 
 	dw_ctf_trace("Initializing module: %s\n", module_name);
@@ -2220,7 +2226,8 @@ static void detect_duplicates_alias_fixup_internal(Dwarf_Die *die,
  * Indirectly recursively called for types depending on other types, and for
  * the types of variables (which for the sake of argument we call 'types' here
  * too, since we treat them exactly like types, and dealing with types is our
- * most important function.)
+ * most important function).  In such calls, the module_name may be 'shared_ctf'
+ * if this type is in the shared CTF repository.
  */
 static ctf_full_id_t *construct_ctf_id(const char *module_name,
 				       const char *file_name,
@@ -2230,6 +2237,7 @@ static ctf_full_id_t *construct_ctf_id(const char *module_name,
 	char *id = type_id(die, NULL, NULL);
 	char *ctf_module;
 	ctf_file_t *ctf;
+	ctf_snapshot_id_t snapshot;
 
 	dw_ctf_trace("    %p: %s: looking up %s: %s\n", &id, module_name,
 		     dwarf_diename(die), id);
@@ -2279,7 +2287,7 @@ static ctf_full_id_t *construct_ctf_id(const char *module_name,
 	if (ctf == NULL) {
 		ctf = init_ctf_table(ctf_module);
 		dw_ctf_trace("%p: %s: initialized CTF file %p\n", &id,
-			     module_name, ctf);
+			     ctf_module, ctf);
 	}
 
 	/*
@@ -2293,9 +2301,11 @@ static ctf_full_id_t *construct_ctf_id(const char *module_name,
 	 * representation implicitly assumes that they cannot.)
 	 */
 
+	snapshot = ctf_snapshot(ctf);
+
 	enum skip_type skip = SKIP_CONTINUE;
 	dw_ctf_trace("%p: into die_to_ctf() for %s\n", &id, id);
-	ctf_id_t this_ctf_id = die_to_ctf(module_name, file_name, die,
+	ctf_id_t this_ctf_id = die_to_ctf(ctf_module, file_name, die,
 					  parent_die, ctf, -1, 0, 1, &skip,
 					  NULL, id);
 	dw_ctf_trace("%p: out of die_to_ctf()\n", &id);
@@ -2307,16 +2317,10 @@ static ctf_full_id_t *construct_ctf_id(const char *module_name,
 	}
 
 	if (skip != SKIP_ABORT) {
-		if (ctf_update(ctf) < 0) {
-			fprintf(stderr, "Cannot update CTF file: %s\n",
-				ctf_errmsg(ctf_errno(ctf)));
-			exit(1);
-		}
-
 		ctf_id->ctf_file = ctf;
 		ctf_id->ctf_id = this_ctf_id;
 #ifdef DEBUG
-		strcpy(ctf_id->module_name, module_name);
+		strcpy(ctf_id->module_name, ctf_module);
 		strcpy(ctf_id->file_name, file_name);
 #endif
 		g_hash_table_replace(id_to_type, id, ctf_id);
@@ -2328,14 +2332,16 @@ static ctf_full_id_t *construct_ctf_id(const char *module_name,
 		/*
 		 * Failure.  Remove the type from the id_to_type mapping, if it
 		 * is there, and discard any added types from the CTF.
+		 *
+		 * If we have had to ctf_update() due to a new type getting
+		 * used, the rollback will fail: discard instead. It might leave
+		 * some spurious types hanging around but it will clean up as
+		 * much as we can at this point.
 		 */
 
-		if (ctf_discard(ctf) < 0) {
-			fprintf(stderr, "Cannot discard from CTF file on "
-				"conversion failure or skip: %s\n",
-				ctf_errmsg(ctf_errno(ctf)));
-			exit(1);
-		}
+		if (ctf_rollback(ctf, snapshot) < 0)
+			if (ctf_errno(ctf) == ECTF_OVERROLLBACK)
+				ctf_discard(ctf);
 
 		free(ctf_id);
 		ctf_id = NULL;
@@ -2492,19 +2498,6 @@ static ctf_id_t die_to_ctf(const char *module_name, const char *file_name,
 			*ctf_id = full_ctf_id;
 
 			g_hash_table_replace(id_to_type, xstrdup(id), ctf_id);
-
-			/*
-			 * This prevents a clean rollback on error from deeply
-			 * nested types: some unreachable types may persist.
-			 * Probably unfixable wihtout a radical rewrite of
-			 * libctf (a good idea anyway, ctf_update() is terribly
-			 * slow).
-			 */
-			if (ctf_update(ctf) < 0) {
-				fprintf(stderr, "Cannot update CTF file: %s\n",
-					ctf_errmsg(ctf_errno(ctf)));
-				exit(1);
-			}
 		}
 
 		/*
@@ -3088,6 +3081,8 @@ static ctf_id_t assemble_ctf_struct_union(const char *module_name,
 
 	const char *name = dwarf_diename(die);
 	int is_union = (dwarf_tag(die) == DW_TAG_union_type);
+	ctf_memb_count_t *member_count = NULL;
+	ctf_id_t id;
 
 	/*
 	 * FIXME: these both need handling for DWARF4 support.
@@ -3100,32 +3095,52 @@ static ctf_id_t assemble_ctf_struct_union(const char *module_name,
 	 * of one with the same name and at least as many members.  If we
 	 * already know of one and it is shorter, we want to use its ID rather
 	 * than creating a new one.
+	 *
+	 * Note; by this point, the deduplicator has long run: thus we know for
+	 * sure what module a potentially-shared type will end up in, and
+	 * there's no need to double-check the shared CTF repository for types.
+	 * We also know that the module must exist in the per_module hash.
 	 */
 
 	if (name != NULL) {
-		ctf_id_t existing;
 		char *structized_name = NULL;
+		per_module_t *ctf_pm;
 
 		structized_name = str_appendn(structized_name,
-					      is_union ? "union " : "struct ",
+					      is_union ? "u " : "s ",
 					      name, NULL);
 
-		existing = ctf_lookup_by_name(ctf, structized_name);
-		free(structized_name);
+		ctf_pm = g_hash_table_lookup(per_module, module_name);
+		member_count = g_hash_table_lookup(ctf_pm->member_counts,
+						   structized_name);
 
-		if (existing >= 0) {
+		if (member_count) {
+			free(structized_name);
 			dw_ctf_trace("%s: already exists (with ID %li) with "
-				     "%li members versus current %li members\n",
-				     locerrstr, existing, count_ctf_members(ctf, existing),
+				     "%zi members versus current %li members\n",
+				     locerrstr, member_count->ctf_id,
+				     member_count->count,
 				     count_dwarf_members(die));
 
-			if (count_ctf_members(ctf, existing) <
-			    count_dwarf_members(die))
-				return existing;
+			if (member_count->count < count_dwarf_members(die))
+				return member_count->ctf_id;
 
 			*skip = SKIP_SKIP;
-			return existing;
+			return member_count->ctf_id;
+		}
+
+		/*
+		 * Not in existence yet.  Create it.
+		 */
+		member_count = malloc(sizeof(struct ctf_memb_count));
+		if (member_count == NULL) {
+			fprintf(stderr, "Out of memory allocating "
+				"structure/union member count\n");
+			exit(1);
 		}
+		member_count->count = 0;
+		g_hash_table_insert(ctf_pm->member_counts,
+				    structized_name, member_count);
 	}
 
 	dw_ctf_trace("%s: adding structure %s\n", locerrstr, name);
@@ -3134,8 +3149,13 @@ static ctf_id_t assemble_ctf_struct_union(const char *module_name,
 	else
 		ctf_add_sou = ctf_add_struct;
 
-	return ctf_add_sou(ctf, top_level_type ? CTF_ADD_ROOT : CTF_ADD_NONROOT,
-			   name);
+	id = ctf_add_sou(ctf, top_level_type ? CTF_ADD_ROOT : CTF_ADD_NONROOT,
+			 name);
+
+	if (member_count != NULL)
+		member_count->ctf_id = id;
+
+	return id;
 }
 
 /*
@@ -3161,9 +3181,32 @@ static ctf_id_t assemble_ctf_su_member(const char *module_name,
 	Dwarf_Attribute type_attr;
 	Dwarf_Die type_die;
 	Dwarf_Die cu_die;
+	ctf_memb_count_t *member_count;
+	const char *struct_name = dwarf_diename(parent_die);
 
 	CTF_DW_ENFORCE(type);
 
+	/*
+	 * Increment the member count of named structures.  This is the number
+	 * of members in the DWARF, not in the CTF: blacklisted members are
+	 * counted too.
+	 */
+	if (struct_name != NULL) {
+		int is_union = (dwarf_tag(parent_die) == DW_TAG_union_type);
+		char *structized_name = NULL;
+		per_module_t *ctf_pm;
+
+		structized_name = str_appendn(structized_name,
+					      is_union ? "u " : "s ",
+					      struct_name, NULL);
+
+		ctf_pm = g_hash_table_lookup(per_module, module_name);
+		member_count = g_hash_table_lookup(ctf_pm->member_counts,
+						   structized_name);
+		member_count->count++;
+		free(structized_name);
+	}
+
 	/*
 	 * If this member is blacklisted, just skip it.
 	 */
@@ -3364,18 +3407,61 @@ static ctf_id_t assemble_ctf_su_member(const char *module_name,
 		if (ctf_errno(ctf) == ECTF_DUPLICATE)
 			return parent_ctf_id;
 
-		if (ctf_errno(ctf) == ECTF_BADID) {
+		/*
+		 * CTF doesn't know of of either this member's type or the
+		 * enclosing structure.  Try a ctf_update() in case this is
+		 * recently added.
+		 */
+
+		if (ctf_errno(ctf) == ECTF_BADID ||
+		    ctf_errno(ctf) == ECTF_NOTSOU) {
+
+			ctf_file_t *shared_ctf;
+
+			/*
+			 * Try an update of the current CTF file first, to bring
+			 * the type ID table up to date: if that doesn't work,
+			 * try an update of the shared table.  (If none is
+			 * needed, this is cheap.)
+			 */
+
+			if (ctf_update(new_type->ctf_file) < 0) {
+				fprintf(stderr, "Cannot update CTF file: %s\n",
+					ctf_errmsg(ctf_errno(ctf)));
+				exit(1);
+			}
+
+			if (ctf_add_member_offset(ctf, parent_ctf_id,
+						  dwarf_diename(die),
+						  new_type->ctf_id,
+						  offset) == 0)
+				return parent_ctf_id;
+
+			shared_ctf = lookup_ctf_file("shared_ctf");
+			if (ctf_update(shared_ctf) < 0) {
+				fprintf(stderr, "Cannot update shared CTF: %s\n",
+					ctf_errmsg(ctf_errno(shared_ctf)));
+				exit(1);
+			}
+
+			if (ctf_add_member_offset(ctf, parent_ctf_id,
+						  dwarf_diename(die),
+						  new_type->ctf_id,
+						  offset) == 0)
+				return parent_ctf_id;
 #ifdef DEBUG
-			fprintf(stderr, "%s: Internal error: bad ID %s:%s:%p:%i "
+			fprintf(stderr, "%s: Internal error: %s %s:%s:%p:%i "
 				"on member addition to ctf_file %p.\n",
-				locerrstr, new_type->module_name,
+				locerrstr, ctf_errmsg(ctf_errno(ctf)),
+				new_type->module_name,
 				new_type->file_name, new_type->ctf_file,
 				(int) new_type->ctf_id, ctf);
 #else
-			fprintf(stderr, "%s: Internal error: bad ID %p:%i on "
+			fprintf(stderr, "%s: Internal error: %s %p:%i on "
 				"member addition to ctf_file %p.\n",
-				locerrstr, new_type->ctf_file,
-				(int) new_type->ctf_id, ctf);
+				locerrstr, ctf_errmsg(ctf_errno(ctf)),
+				new_type->ctf_file, (int) new_type->ctf_id,
+				ctf);
 #endif
 			return CTF_ERROR_REPORTED;
 		}
@@ -3494,6 +3580,12 @@ static void write_types(char *output_dir)
 
 		dw_ctf_trace("Writeout path: %s\n", path);
 
+		if (ctf_update(per_mod->ctf_file) < 0) {
+			fprintf(stderr, "Cannot serialize CTF file %s: %s\n",
+				path, ctf_errmsg(ctf_errno(per_mod->ctf_file)));
+			exit(1);
+		}
+
 		if ((fd = gzopen(path, "wb")) == NULL) {
 			fprintf(stderr, "Cannot open CTF file %s for writing: "
 				"%s\n", path, strerror(errno));
@@ -3820,30 +3912,6 @@ static long count_dwarf_members(Dwarf_Die *d)
 	exit(1);
 }
 
- /*
- * Count the number of members of a CTF aggregate.
- */
-static long count_ctf_members(ctf_file_t *fp, ctf_id_t souid)
-{
-	long count = 0;
-
-	ctf_member_iter(fp, souid, count_ctf_members_internal, &count);
-
-	return count;
-}
-
-/*
- * Increment said count.
- */
-static int count_ctf_members_internal(const char *name, ctf_id_t member,
-				      ulong_t offset, void *data)
-{
-	long *count = (long *) data;
-
-	(*count)++;
-	return 0;
-}
-
 /*
  * Free a per_module's contents.
  */
@@ -3852,6 +3920,7 @@ static void private_per_module_free(void *per_module)
 	per_module_t *per_mod = per_module;
 
 	ctf_close(per_mod->ctf_file);
+	g_hash_table_destroy(per_mod->member_counts);
 }
 
 /*