Skip to content

[Bug]: the fit_html (and probably cleaned_html) generator removes rowspan/colspan from tables, leads to incorrect martkdown formatting and errors in LLM extraction results #1920

@pbtsrc

Description

@pbtsrc

crawl4ai version

0.8.5

Expected Behavior

Keep rowspan/colspan in fit_html and cleaned_html

Current Behavior

rowspan/colspan removed from fit_html and cleaned_html

Is this reproducible?

Yes

Inputs Causing the Bug

Steps to Reproduce

Code snippets

OS

Linux

Python version

3.13

Browser

No response

Browser version

No response

Error logs & Screenshots (if applicable)

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    🐞 BugSomething isn't working🩺 Needs TriageNeeds attention of maintainers

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions