Skip to content

Preparing Biomedical Multimodal (Text+Images/Tables/Formulas) Dataset for Gemma Fine-tuning #210

Open
@virologist

Description

@virologist

Hi,@gemma team

I’m working on a biomedical/medical domain fine-tuning dataset converted from PDFs to Markdown. The content contains:
• Images (medical diagrams, charts)
• Complex tables (clinical data)
• Mathematical formulas (drug dosage equations, statistical models)
Could the Gemma team advise:
1. Best practices for handling multimodal elements in Markdown format during dataset preparation?
2. Recommended preprocessing steps for preserving semantic relationships between text & non-text elements?
3. Any existing tutorials/documentation for similar technical/academic domain fine-tuning?

Any help would be greatly appreciated!
Yang

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions