Posts
1407
Following
142
Followers
868
I'm a bit of an eclectic mess 🙂 I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: 🇱🇰🇸🇦🇺🇸🇳🇿🇸🇬🇲🇾🇦🇪🇫🇷🇪🇸🇵🇹🇶🇦🇨🇦

Fahim Farook

And that’s it for papers today — 6 papers boosted from the cs.CV category (and one outside) out of a total of 98 on arXiv.org today ...

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
1
1

Fahim Farook

"Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation Systems. (arXiv:2301.01914v2 [cs.CV] UPDATED)" — Comparing the accuracy and fidelity of images generated by DALL-E 2 and Luna, which is Stable Diffusion-based.

Paper: http://arxiv.org/abs/2301.01914
Luna code: https://github.com/slowy07/luna

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Selected image samples from the…
0
0
1

Fahim Farook

"An efficient deep neural network to find small objects in large 3D images. (arXiv:2210.08645v2 [cs.CV] UPDATED)" — What it says in the title 🙂

Paper: http://arxiv.org/abs/2210.08645
Code: https://github.com/nyukat/3d_gmic

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
0
1
1

Fahim Farook

"Leveraging Large Language Model and Story-Based Gamification in Intelligent Tutoring System to Scaffold Introductory Programming Courses: A Design-Based Research Study" — Using Large Language Models (LLM) and gamification to teach programming in a more digestible format (especially) in introductory programming courses.

Paper: https://arxiv.org/abs/2302.12834

#AI #NewPaper #DeepLearning #MachineLearning #HumanComputerInteraction

<<Find this useful? Please boost so that others can benefit too 🙂>>
0
1
3

Fahim Farook

"Diffusion Posterior Sampling for General Noisy Inverse Problems. (arXiv:2209.14687v3 [stat.ML] UPDATED)" — Extending diffusion solvers to efficiently handle general noisy (non)linear inverse problems via approximation of the posterior sampling.

Paper: http://arxiv.org/abs/2209.14687
Code: https://github.com/dps2022/diffusion-posterior-sampling

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Solving noisy linear, and nonli…
0
0
1

Fahim Farook

"Subspace Diffusion Generative Models. (arXiv:2205.01490v2 [cs.LG] UPDATED)" — Restricting diffusion via projections onto subspaces to reduce computational time and cost without affecting the overall quality of the generated image.

Paper: http://arxiv.org/abs/2205.01490
Code: https://github.com/bjing2016/subspace-diffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Random high resolution samples …
0
0
0

Fahim Farook

"Large Scale Visual Food Recognition. (arXiv:2103.16107v3 [cs.CV] UPDATED)" — A food dataset with 2,000 categories and over 1 million images that can be used for food recognition.

Paper: http://arxiv.org/abs/2103.16107
Code: https://github.com/Liuyuxinict/prenet/

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
The distributions over each cat…
0
1
1

Fahim Farook

"Directed Diffusion: Direct Control of Object Placement through Attention Guidance. (arXiv:2302.13153v1 [cs.CV])" — Controlling object placement in diffusion models by way of attention guidance.

Paper: http://arxiv.org/abs/2302.13153

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Directed Diffusion (DD) key res…
0
0
1

Fahim Farook

A total of 98 papers in the cs.CV category on arXiv.org today — 35 new, 63 updated.

Gentlepeople, start your engines 🙂

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
1

Fahim Farook

Call me contrary, but when I see people saying “I want to work at <Apple/Google/Microsoft>”, I ask, “Why?”

Somebody at work told me recently that Apple would want to hire me because I did/knew something only like 10 people in the world could do and 7 of them worked at Apple. So I was like, “If 7 of them are at Apple, why would Apple want to hire me?”

Plus, I don’t want to work at Apple 😛

I’ve been at Apple as a visitor and seen what the culture is like, and I really don’t want to work at that kind of place. Plus, I really like working from home and based on what I saw (and what I’ve read since the pandemic) Apple really does not seem to like people working from home. Based on what I saw, that tallies.

So why would I work for a company which has diametrically opposed viewpoints to mine? Just because it says “something” about me? Or because I think they might pay me a lot better and money fixes everything?

I do think that if I did that, it would say something about me, but not the same “something” that others are thinking of. Personally, I’d think that I put other people’s opinion of what I am over my own comfort. And I don’t want to do that …

#Work #Companies #Opinions
0
0
0

Fahim Farook

Boosted 3 papers in the cs.CV category (and one outside) out of a total of 49 new and updated papers on arXiv.org today.

On to other things now …

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

Fahim Farook

"In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages" — Measuring the formality of the generated text for different languages using multilingual generative language models.

Paper: https://arxiv.org/abs/2302.12299

#AI #NewPaper #DeepLearning #MachineLearning #Language

<<Find this useful? Please boost so that others can benefit too 🙂>>
Differences between formal and …
0
0
1

Fahim Farook

"ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation. (arXiv:2209.04145v6 [cs.CV] UPDATED)" — Using 2D images as a stepping stone for creating 3D shapes and eliminating the need for paired text-shape data.

Paper: http://arxiv.org/abs/2209.04145
Code: https://github.com/liuzhengzhe/ISS-Image-as-Stepping-Stone-for-Text-Guided-3D-Shape-Generation

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Our novel “Image as Stepping St…
0
1
2

Fahim Farook

"Modulating Pretrained Diffusion Models for Multimodal Image Synthesis. (arXiv:2302.12764v1 [cs.CV])" — Multimodal Conditioning Modules (MCM) for enabling conditional image synthesis using pretrained diffusion models so that you can generate images using not just a text prompt, but additional input such as a segmentation map or a sketch.

Paper: http://arxiv.org/abs/2302.12764

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Multimodal conditioning modules…
0
1
1

Fahim Farook

"Surface Recognition for e-Scooter Using Smartphone IMU Sensor. (arXiv:2302.12720v1 [eess.SP])" — Detecting whether an e-scooter is on a paved road or a sidewalk using the Inertial Measurement Unit (IMU) sensors on a smartphone.

Paper: http://arxiv.org/abs/2302.12720

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Left: an example of a street wi…
0
1
0

Fahim Farook

We (the wife and I) like playing console games — we especially like co-op games that we can both play at once. But we hate split-screen co-op since we get confused as to who is on which part of the screen 😛

So we are kind of limited on the games we can play. Because of that, when we have nothing else to play, we do try single-player games …

Yesterday was one such day where we tried a bunch of single-player games on PS5 to see if we can find something that we liked.

First up was, “Outriders” — the graphics looked great, it was a science-fiction storyline and so we thought we might like it. Unfortunately, it’s a mostly “shooting” game. You have to keep shooting people to advance and we don’t like that kind of game much …

Sure, we don’t mind if there’s some combat, but we prefer a storyline, puzzles, exploration, that kind of thing …

So, we tried “ReadySet Heroes” next. This one was co-op, but split screen. Given that we didn’t have a lot of choices, we decided to give it a try. But unfortunately, it was just dungeon crawling. No story, nothing of interest unless you just like mindlessly bashing things. Next!

The last one we tried was “Omno” and here we hit the Goldilocks zone … almost 😛 It was single-player, but it wasn’t frantic shooting/button mashing. It had a gentle, explore-at-your-pace kind of gameplay and lots of puzzles. Sure, we had to pass the controller back and forth between us but we still enjoyed it way more than any of the others and that’s the one we stuck with for the rest of the day 🙂

#Gaming #PS5 #CoOpPlay
0
0
1

Fahim Farook

A total of 49 papers in the cs.CV category on arXiv.org today — 34 new, 15 updated.

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

One of the best consequences of my switch to has been exposure to people with hobbies and interests different from mine. By following other instances or particular hashtags I get to learn about and learn from people and communities I would otherwise not have encountered.

0
3
0

American Robin loaf

0
1
0
Show older