gaussian_nll_loss doesn't work on GPU

https://github.com/dotnet/TorchSharp/blob/ba2fa7506f2cafc61440ff72db5f8a31fafc4c58/src/TorchSharp/NN/Losses.cs#L422

I had to change
```csharp
if ((variance < 0).any().item<bool>())
    throw new ArgumentException("variance has negative entry/entries");
```
into
```csharp
if ((variance < 0).any().to(DeviceType.CPU).item<bool>())
    throw new ArgumentException("variance has negative entry/entries");
```
in order to make it work on a GPU, because one can't extract an `item()` from tensor unless the tensor resides on CPU.

However when looking at this code, I think this whole line is unnecessary (if `variance` is less than zero it gets clamped to `eps` in the next line) as well as slow, as it requires synchronizing GPU state and copying data back to CPU which is inefficient and causes extra I/O latency.

Also
```csharp
variance = variance.clone().maximum(torch.tensor(eps))
```
causes an unnecessary cloning of tensor (`maximum` is supposed to return a new tensor anyway) and tensor allocation for `eps`.  
I suggest to replace that with:
```csharp
variance = variance.clamp_min(eps);
```



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gaussian_nll_loss doesn't work on GPU #632

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

gaussian_nll_loss doesn't work on GPU #632

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions