Posts
1587
Following
139
Followers
883
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

And that’s the start of another week of new paper reading — boosted 8 papers out of a total of 84 new and updated papers in the cs.CV category on arXiv.org

That feels like a whole lot of papers today … just another Monday, eh? 😛

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

Fahim Farook

"Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training. (arXiv:2210.07688v2 [cs.CL] UPDATED)" — A study of the object hallucination problem in large-scale Vision-Language Pre-trained (VLP) models from multiple aspects.

Paper: http://arxiv.org/abs/2210.07688
Code: No code in linked repo

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Comparison of image captioning …
0
1
0

Fahim Farook

"Dive into Deep Learning. (arXiv:2106.11342v4 [cs.LG] UPDATED)" — An open-source book on Deep Learning based on Jupyter Notebooks so that it contains interactive examples. Freely available and well-worth checking out.

Paper: http://arxiv.org/abs/2106.11342
Code: https://github.com/d2l-ai/d2l-en
Book: https://d2l.ai/

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Image of book cover for Dive in…
0
5
8

Fahim Farook

"Scaling Vision Transformers to 22 Billion Parameters. (arXiv:2302.05442v1 [cs.CV])" — A recipe for highly efficient and stable training of a 22B-parameter Vision Transformers (ViT) overtaking the previously known 4B parameter model.

Paper: http://arxiv.org/abs/2302.05442

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Dense prediction from frozen Vi…
0
2
1

Fahim Farook

Sometimes, reading #MachineLearning papers, I find my brain turning off — I read the words, but I don’t get the meaning. I have to then stop myself and go back and read each of the words individually to make sense of them …

This probably is a sign that I need to stop reading papers 😛 But does this happen to anybody else? I think I’ve started doing this for everything, even fiction, and that’s why I can’t read as much any longer …
2
0
1

Fahim Farook

"Rumor Classification through a Multimodal Fusion Framework and Ensemble Learning. (arXiv:2302.05289v1 [cs.CV])" — A set of advanced image features that are inspired from the field of image quality assessment, to assess message veracIty in social networks, which exploits all message features by exploring various machine learning models.

Paper: http://arxiv.org/abs/2302.05289

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
An overview of the proposed rum…
0
2
1

Fahim Farook

"Archaeological Sites Detection with a Human-AI Collaboration Workflow. (arXiv:2302.05286v1 [cs.CV])" — Using pre-trained semantic segmentation deep learning models to detect archaeological sites within the Mesopotamian floodplains environment.

Paper: http://arxiv.org/abs/2302.05286
Code: https://github.com/mister-magpie/tell_segmentation

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Investigation area. Orange dots…
0
8
8

Fahim Farook

"CEN-HDR: Computationally Efficient neural Network for real-time High Dynamic Range imaging. (arXiv:2302.05213v1 [cs.CV])" — A new computationally efficient neural network based on a light attention mechanism and sub-pixel convolution operations for real-time HDR imaging.

Paper: http://arxiv.org/abs/2302.05213
Code: https://github.com/steven-tel/CEN-HDR

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Qualitative comparison of the p…
0
2
1

Fahim Farook

"DOMINO: Domain-aware Loss for Deep Learning Calibration. (arXiv:2302.05142v1 [cs.CV])" — A domain-aware loss function to calibrate deep learning models so as to avoid the potential dangers of uncalibrated models in medical imaging.

Paper: http://arxiv.org/abs/2302.05142
Code: https://github.com/lab-smile/DOMINO

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Confusion matrices on testing s…
0
3
0

Fahim Farook

"Example-Based Sampling with Diffusion Models. (arXiv:2302.05116v1 [cs.GR])" — A generic way to produce 2-d point sets imitating existing samplers from observed point sets using a diffusion model which addresses the problem of convolutional layers by leveraging neighborhood information from an optimal transport matching to a uniform grid, that allows benefiting from fast convolutions on grids, and to support the example-based learning of non-uniform sampling patterns.

Paper: http://arxiv.org/abs/2302.05116

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Learning Rank-1 realizations us…
0
1
0

Fahim Farook

A total of 84 papers in the cs.CV category on arXiv.org today — 53 new, 31 updated.

Don’t really feel ready for Monday … but ah, well … on to the papers!

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0
@AngelaPreston I hate when that happens. In Sri Lanka, they set off fireworks for anything — New Year, just start blasting fireworks at midnight when some of us are trying to sleep.

Your daughter got married? Set off fireworks.

Your favourite team won! Set off fireworks.

Somebody got elected to office. Cue fireworks.

It’s annoying. Sure, you have the right to celebrate whatever and however you want, but do be considerate of others too?
1
0
1
@at Sorry, didn’t mean that this was feature of FB — just that the author of the app was modelling it after FB messenger, but maybe you got that already and are simply saying that you don’t understand what it would look like?

Either way, I guess since the app is not available yet, it’s a moot point 😛
0
0
0

After 2 whole days of sun, the rain came back.... And this beauty as well.
Now we were both soaked, waiting for the shower to end but she gave me all the time to take a few portraits.



1
3
0
@at Haven’t seen anything which would re-order things … though that would be a great feature 🙂 I did see an upcoming app which allows you to use Mastodon like Facebook Messenger — organised by user rather than posts so that all posts by a given user will appear under their name and you’ll have a list of users instead of a timeline …

Other than that, the only thing I can think of is to create your own lists by subject or groups of users but you’re probably doing that already?
0
0
0

Some mushrooms, moss and lichen from the nearby Beaver forest. BeaverCam only had taken 9 photos so it is staying.

0
1
0

Fahim Farook

The images show two separate iterations of the same app — the first is a SwiftUI app for doing #StableDiffusion image generation using #CoreML. That took several weeks of work to create and it never really worked right — I could only do single selection of images, couldn’t drag and drop more than one image, and it took a fair amount of time to implement even trivial stuff.

The second is the same app implemented using AppKit. It took essentially one day (yesterday). Multiselection was built-in, drag and drop (for multiple items) was a couple of lines of code, and I implemented a bunch of other things I wanted to do fairly easily.

On top of that, the app feels faster, lighter, and more responsive than the SwiftUI version for the exact same task, using the exact same models.

SwiftUI has a long way to go still, if it will even ever get there …

#Coding #macOS #Swift #SwiftUIvsAppKit
Screenshot of an image generati…
Screenshot of an image generati…
0
2
6
@AngelaPreston Not a fan of sports but I can relate to listening to the radio 🙂 When I was growing up, that used to be the only form of entertainment — going to the cinema was expensive and was a “special occasion” thing …

We’d wait the whole week for a particular radio drama to come up and so on. I still remember my Dad and his brothers crowded around the radio listening to sports events …
0
0
1

Another D&D session podcast released - traveling through Wildspace.

Our podcasts have shifted to more "cinematic" replays with lots of background music and sound effects.

Hope you enjoy!

https://open.spotify.com/episode/0ZmwWu4LX5QmQyVuSLIoYZ

0
1
0

Sportscaster: I hope you're all ready for Super Bowl Sunday.

Me: No, but I am ready for Superb Owl Sunday haha

Owl: We get one single holiday and it's just a big fucking joke to you people.

0
1
0
Show older