Remix clone Hacker News

new | show | ask | jobs Github

	▲	zakeria 2 days ago
		Thanks - good question, in theory, the uGMM layer could complement CNNs in different ways - for example, one could imagine (as you mentioned): using standard convolutional layers for feature extraction, then replacing the final dense layers with uGMM neurons to enable probabilistic inference and uncertainty modeling on top of the learned features. My current focus, however, is exploring how uGMMs translate into Transformer architectures, which could open up interesting possibilities for probabilistic reasoning in attention-based models.