Add `get_model_state` to get validated doc. #309

mnbbrown · 2025-10-07T10:13:14Z

Add get_model_state to Doc

This adds get_model_state which returns the entire doc state as the
pydantic model pass in as Model= during instantiation.

It's effectively the other side of apply_update.

If the doc has no Model defined it will raise a RuntimeError.
If the doc is invalid it will raise a pydantic ValidationError.

Add to_py to Doc
get_mode_state uses a new to_py for Doc (1aa8dfc)
It iterates through the roots of the Doc, converting them into their
native python types (using the underlying type's to_py() fn).

additional changes
Also, update apply_update to use model_validate instead of
Model(**value). This is more of a "style" thing.
See pydantic/pydantic#9676

Also add ruff to the test dependencies and format given it's config was already
defined in pyproject.toml.

This adds `get_model_state` which returns the entire doc state as the pydantic model pass in as `Model=` during instantiation. It's effectively the other side of `apply_update`. If the doc has no Model defined it will raise a RuntimeError. If the doc is invalid it will raise a pydantic ValidationError. Also, update `apply_update` to use `model_validate` instead of Model(**value). This is more of a "style" thing. See pydantic/pydantic#9676 Also add ruff to the test dependencies and format.

mnbbrown · 2025-10-07T10:18:32Z

python/pycrdt/_doc.py

            d = {k: twin_doc[k].to_py() for k in self._Model.model_fields}
            try:
-                self._Model(**d)
+                self._Model.model_validate(d)


This is the style change i refer to in the PR description.

mnbbrown · 2025-10-07T15:53:16Z

@davidbrochart This actually doesn't work in it's current iteration because _roots gets updated lazily when calling get(). Have you had any thoughts about how _roots maybe could be kept in sync? I think either making _roots look at the rust state directly, or updating _roots when running apply_update

Any thoughts?

This iterates through the roots of the Doc, converting them into their native python types (using the underlying type's to_py() fn). tbd if this can be used to replace `_roots` as well?

davidbrochart · 2025-10-08T13:52:45Z

Sorry @mnbbrown I'm a bit low on bandwidth currently, but excited to see what you're doing.

mnbbrown · 2025-10-08T15:48:09Z

Sorry @mnbbrown I'm a bit low on bandwidth currently, but excited to see what you're doing.

No worries! I know how it is :)

I think this is ready for a review now - I hope the description is sufficiently clear. Let me know if anything else needs explanation.

davidbrochart

Thanks @mnbbrown, I left some minor changes and I have questions.
I'm wondering if get_model_state could be supported without enabling validation in apply_update, which is costly because it requires a "twin doc". Maybe a parameter to Doc like validate_updates=False. Maybe in the future a validate_changes=True parameter could validate changes before applying them to the document too.
Also, you may want to look at type annotations if you don't need validation, which works at static type analysis and at run-time.

davidbrochart · 2025-10-09T06:58:13Z

python/pycrdt/_doc.py

        skip_gc: bool | None = None,
        doc: _Doc | None = None,
-        Model=None,
+        Model: Any | None = None,


Suggested change

Model: Any | None = None,

Model: Any = None,

davidbrochart · 2025-10-09T07:01:24Z

python/pycrdt/_doc.py

+    def get_model_state(self) -> Any:
+        if self._Model is None:
+            raise RuntimeError(
+                "no Model defined for doc. Instantiate Doc with Doc(Model=PydanticModel)"


Suggested change

"no Model defined for doc. Instantiate Doc with Doc(Model=PydanticModel)"

"Document has no model"

davidbrochart · 2025-10-09T07:09:46Z

tests/test_model.py

+    )
+
+
+def test_model_no_model_defined():


Suggested change

def test_model_no_model_defined():

def test_model_not_defined():

davidbrochart · 2025-10-09T07:10:16Z

tests/test_model.py

+    with pytest.raises(RuntimeError) as exc_info:
+        local_doc.get_model_state()
+
+    assert str(exc_info.value).startswith("no Model defined for doc")


Suggested change

assert str(exc_info.value).startswith("no Model defined for doc")

assert str(exc_info.value).startswith("Document has no model")

davidbrochart · 2025-10-09T07:19:31Z

python/pycrdt/_doc.py

+            )
+        with self.transaction() as txn:
+            assert txn._txn is not None
+            all_roots = self._doc.to_py(txn._txn)


I'm wondering why you don't do the same as in Doc.apply_update?:

d = {k: self._doc[k].to_py() for k in self._Model.model_fields} self._Model.model_validate(d)

davidbrochart · 2025-10-09T07:22:08Z

src/doc.rs

        result.into()
    }

+    fn to_py(&self, py: Python<'_>, txn: &mut Transaction) -> PyResult<Py<PyAny>> {


Does this convert nested shared data to Python too?

davidbrochart · 2025-10-09T07:24:11Z

pyproject.toml

    "mypy",
    "coverage[toml] >=7",
    "exceptiongroup; python_version<'3.11'",
+    "ruff>=0.13.3",


Since you're introducing ruff, maybe we should use pre-commit to check and lint?

mnbbrown added 2 commits October 7, 2025 11:06

Use timezone.utc instead of datetime.UTC

6443b94

mnbbrown commented Oct 7, 2025

View reviewed changes

Add test where model not defined.

663d169

mnbbrown marked this pull request as draft October 7, 2025 10:30

Implement to_py for Doc.

1aa8dfc

This iterates through the roots of the Doc, converting them into their native python types (using the underlying type's to_py() fn). tbd if this can be used to replace `_roots` as well?

mnbbrown force-pushed the pydantic-model-validation branch from 5cf8926 to 1aa8dfc Compare October 8, 2025 13:24

mnbbrown marked this pull request as ready for review October 8, 2025 15:45

davidbrochart reviewed Oct 9, 2025

View reviewed changes

	"no Model defined for doc. Instantiate Doc with Doc(Model=PydanticModel)"
	"Document has no model"

	def test_model_no_model_defined():
	def test_model_not_defined():

	assert str(exc_info.value).startswith("no Model defined for doc")
	assert str(exc_info.value).startswith("Document has no model")

Add get_model_state to get validated doc. #309

Are you sure you want to change the base?

Add get_model_state to get validated doc. #309

Uh oh!

Conversation

mnbbrown commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mnbbrown commented Oct 7, 2025

Uh oh!

davidbrochart commented Oct 8, 2025

Uh oh!

mnbbrown commented Oct 8, 2025

Uh oh!

davidbrochart left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add `get_model_state` to get validated doc. #309

Add `get_model_state` to get validated doc. #309

mnbbrown commented Oct 7, 2025 •

edited

Loading