Obtain Character-Level Bounding Boxes of Generated Text #3921

enchainingrealm · 2019-06-27T18:26:12Z

I'm placing some text on an image as follows, using Python 3.7.3 and PIL 5.4.1:

from PIL import Image, ImageDraw, ImageFont

image = ...
font_filepath = ...
font_size = ...

draw = ImageDraw.Draw(image)
font = ImageFont.truetype(font_filepath, font_size)

xy = ...   # generate random (x, y) coordinates to place the text at
text = ...

draw.text(xy, text, font=font)

I want to get the character-level bounding boxes around the text placed on the image. For example, if text = "hello", I want a list of five rectangles, each of which bounds a corresponding letter in "hello". For example, the bounding box for "l" should be thinner and taller than the bounding box for "o", and the bounding boxes for the two "l"s should have different x-positions and same y-positions.

I have investigated using:

size = font.getsize(text)
mask = font.getmask(text)

However, I don't know how to interpret mask, because:

mask is an ImagingCore object instead of an Image object
len(mask) does not even equal the area calculated by size[0] * size[1]

What is the easiest way to obtain character-level bounding boxes for text placed on an image by PIL?

The text was updated successfully, but these errors were encountered:

radarhere · 2019-06-28T21:53:11Z

from PIL import Image, ImageDraw, ImageFont

image = Image.new("RGB", (200, 100))
font_filepath = "/Library/Fonts/Arial.ttf"
font_size = 50

draw = ImageDraw.Draw(image)
font = ImageFont.truetype(font_filepath, font_size)

xy = (50, 20)
text = "hello"

draw.text(xy, text, font=font)

for char in text:
	print(font.getmask(char).size)

radarhere · 2019-07-03T10:37:24Z

If it helps, here is where 'size' is defined for ImagingCore -

Pillow/src/_imaging.c

Line 3388 in 292b4d0

{ "size", (getter) _getattr_size },

Let us know if this doesn't answer your question, or if you have any further questions.

enchainingrealm · 2019-07-04T18:04:05Z

This gives me the size of each character's bounding box, but how do I retrieve the location of each character's bounding box?

radarhere · 2019-07-04T20:07:31Z

from PIL import Image, ImageDraw, ImageFont

image = Image.new("RGB", (200, 100))
font_filepath = "/Library/Fonts/Arial.ttf"
font_size = 50

draw = ImageDraw.Draw(image)
font = ImageFont.truetype(font_filepath, font_size)

xy = (40, 20)
text = "hello"

draw.text(xy, text, font=font)

for i, char in enumerate(text):
	right, bottom = font.getsize(text[:i+1])
	width, height = font.getmask(char).size
	right += xy[0]
	bottom += xy[1]
	top = bottom - height
	left = right - width
	
	draw.rectangle((left, top, right, bottom), None, "#f00")

image.save("out.png")

enchainingrealm · 2019-07-04T21:04:53Z

That makes sense, seems like you can just assume the boxes for each character are adjacent to each other and aligned along the bottom.

That answers my question. I'm closing this issue now.

lamhoangtung · 2019-09-19T11:18:59Z

Since @radarhere solution is incorrect with multiple words like this:

I slightly modified it so that it can works on even more cases:

from PIL import Image, ImageDraw, ImageFont

image = Image.new("RGB", (500, 100))
font_filepath = "./template/arial.ttf"
font_size = 50

draw = ImageDraw.Draw(image)
font = ImageFont.truetype(font_filepath, font_size)

xy = (40, 20)
text = "Hoàng Tùng Lâm"

draw.text(xy, text, font=font)

for i, char in enumerate(text):
    bottom_1 = font.getsize(text[i])[1]
    right, bottom_2 = font.getsize(text[:i+1])
    bottom = bottom_1 if bottom_1 < bottom_2 else bottom_2
    width, height = font.getmask(char).size
    right += xy[0]
    bottom += xy[1]
    top = bottom - height
    left = right - width

    draw.rectangle((left, top, right, bottom), None, "#f00")

    draw.rectangle((left, top, right, bottom), None, "#f00")

image.save("out.png")

Hope this could help someone :P

indigoviolet · 2020-07-19T22:26:47Z

The above is close, but not always correct for slanted fonts: see #4789 (comment)

jinyu121 · 2021-07-18T02:56:56Z

@lamhoangtung 's solution is great, but when processing Chinese punctuations, this code can not always get "tight" bounding box. For example:

The font is msyhbd.ttf, and the text is text = "Hoàng Tùng Lâm 测试，测试。测试！测试？Test. Test?"

Is this caused by the font file itself?

lamhoangtung · 2021-07-18T02:58:55Z

@jinyu121 It's likely due to the font itself

radarhere added the Question label Jun 28, 2019

enchainingrealm closed this as completed Jul 4, 2019

nulano mentioned this issue Jun 23, 2020

Implement anchor for truetype text functions #4724

Closed

6 tasks

indigoviolet mentioned this issue Jul 16, 2020

Character bounding boxes and negative offsetx #4789

Closed

This was referenced Oct 9, 2020

Add getlength and getbbox functions for TrueType fonts #4959

Merged

Release notes for TrueType functions and PyPy support #4969

Merged

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Obtain Character-Level Bounding Boxes of Generated Text #3921

Obtain Character-Level Bounding Boxes of Generated Text #3921

enchainingrealm commented Jun 27, 2019 •

edited by radarhere

Loading

radarhere commented Jun 28, 2019

radarhere commented Jul 3, 2019

enchainingrealm commented Jul 4, 2019

radarhere commented Jul 4, 2019

enchainingrealm commented Jul 4, 2019

lamhoangtung commented Sep 19, 2019 •

edited

Loading

indigoviolet commented Jul 19, 2020

jinyu121 commented Jul 18, 2021

lamhoangtung commented Jul 18, 2021

Obtain Character-Level Bounding Boxes of Generated Text #3921

Obtain Character-Level Bounding Boxes of Generated Text #3921

Comments

enchainingrealm commented Jun 27, 2019 • edited by radarhere Loading

radarhere commented Jun 28, 2019

radarhere commented Jul 3, 2019

enchainingrealm commented Jul 4, 2019

radarhere commented Jul 4, 2019

enchainingrealm commented Jul 4, 2019

lamhoangtung commented Sep 19, 2019 • edited Loading

indigoviolet commented Jul 19, 2020

jinyu121 commented Jul 18, 2021

lamhoangtung commented Jul 18, 2021

enchainingrealm commented Jun 27, 2019 •

edited by radarhere

Loading

lamhoangtung commented Sep 19, 2019 •

edited

Loading