ENH: add dtype_from_format option to preserve Excel text formatting #63037

mina1957 · 2025-11-07T22:08:55Z

closes ENH: Columns formatted as "Text" in Excel are read as numbers #61539
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

rhshadrach

Thanks for the PR. Overall, this seems far too complex an implementation for the feature being implemented. Why can we not just determine the format upon reading each cell and react appropriately?

In addition, it does not seem to me pandas should ever read a cell as numeric if that cell is text. I think we should not implement this as a flag.

rhshadrach · 2025-11-11T19:04:20Z

pandas/io/excel/_base.py

+                ordered_levels.append(level_idx)
+        return ordered_levels
+
+    def _convert_index_labels(self, index, levels_to_convert: list[int]):


It looks like code got duplicated?

rhshadrach · 2025-11-11T19:04:37Z

pandas/io/excel/_base.py

+                ordered_levels.append(level_idx)
+        return ordered_levels
+
+    def _convert_index_labels(self, index, levels_to_convert: list[int]):


This looks the same?

rhshadrach · 2025-11-11T19:07:29Z

doc/source/user_guide/io.rst

+This behavior currently applies to the ``openpyxl`` and ``xlrd`` engines. Other
+engines simply ignore the flag until text format detection is implemented for
+them.


I'm negative on diverging behavior between readers unless absolutely necessary.

rhshadrach · 2025-11-11T19:22:38Z

pandas/io/excel/_base.py

+    @staticmethod
+    def _parser_engine(parser):
+        return getattr(parser, "_engine", parser)
+
+    @classmethod
+    def _parser_attr(cls, parser, attribute: str):
+        if hasattr(parser, attribute):
+            return getattr(parser, attribute)
+        engine = cls._parser_engine(parser)
+        if engine is not parser and hasattr(engine, attribute):
+            return getattr(engine, attribute)
+        return None


Why are these necessary?

rhshadrach · 2025-11-11T19:26:02Z

pandas/io/excel/_openpyxl.py


+    @staticmethod
+    def _cell_is_text_formatted(cell) -> bool:
+        number_format = getattr(cell, "number_format", None)


When does cell have and not have the number_format attribute?

mina1957 added 3 commits November 7, 2025 16:44

ENH: add dtype_from_format option to preserve Excel text formatting

b6e4e02

ENH: add dtype_from_format option to preserve Excel text formatting

5e7496c

ran ruff

83ec944

mina1957 requested a review from rhshadrach as a code owner November 7, 2025 22:08

rhshadrach requested changes Nov 11, 2025

View reviewed changes

rhshadrach added the IO Excel read_excel, to_excel label Nov 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: add dtype_from_format option to preserve Excel text formatting #63037

ENH: add dtype_from_format option to preserve Excel text formatting #63037

Uh oh!

mina1957 commented Nov 7, 2025 •

edited

Loading

Uh oh!

rhshadrach left a comment •

edited

Loading

Uh oh!

rhshadrach Nov 11, 2025

Uh oh!

rhshadrach Nov 11, 2025

Uh oh!

rhshadrach Nov 11, 2025

Uh oh!

rhshadrach Nov 11, 2025

Uh oh!

rhshadrach Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ENH: add dtype_from_format option to preserve Excel text formatting #63037

Are you sure you want to change the base?

ENH: add dtype_from_format option to preserve Excel text formatting #63037

Uh oh!

Conversation

mina1957 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rhshadrach left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rhshadrach Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

rhshadrach Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

rhshadrach Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

rhshadrach Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

rhshadrach Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mina1957 commented Nov 7, 2025 •

edited

Loading

rhshadrach left a comment •

edited

Loading