Posts
1639
Following
139
Followers
885
I'm a bit of an eclectic mess ๐Ÿ™‚ I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: ๐Ÿ‡ฑ๐Ÿ‡ฐ๐Ÿ‡ธ๐Ÿ‡ฆ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ‡ณ๐Ÿ‡ฟ๐Ÿ‡ธ๐Ÿ‡ฌ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ฆ๐Ÿ‡ช๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡ช๐Ÿ‡ธ๐Ÿ‡ต๐Ÿ‡น๐Ÿ‡ถ๐Ÿ‡ฆ๐Ÿ‡จ๐Ÿ‡ฆ

Fahim Farook

"T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models. (arXiv:2302.08453v1 [cs.CV])" โ€” Controlling text-to-image diffusion models in a more granular fashion by using special adapters to provide extra guidance.

Paper: http://arxiv.org/abs/2302.08453
Code: https://github.com/TencentARC/T2I-Adapter

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
We propose T2I-Adapter, a simplโ€ฆ
0
1
0

Fahim Farook

"MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. (arXiv:2302.08113v1 [cs.CV])" โ€” Controlling diffusion-based image generation so that you can specify image components, component placement etc. without any further fine-tuning.

Paper: http://arxiv.org/abs/2302.08113
Code: https://github.com/omerbt/MultiDiffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
MultiDiffusion enables flexibleโ€ฆ
0
1
0

Fahim Farook

"\`A-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting. (arXiv:2302.07994v1 [cs.LG])" โ€” Having multiple subsets of data trained on specific prompts and being able to compose the final model based on the prompts you select.

Paper: http://arxiv.org/abs/2302.07994

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
A-la-carte Learning and APT. Giโ€ฆ
0
2
0

Fahim Farook

"PRedItOR: Text Guided Image Editing with Diffusion Prior. (arXiv:2302.07979v1 [cs.CV])" โ€” Structure preserving, text guided image editing using diffusion models without needing a base prompt, fine-tuning of models etc.

Paper: http://arxiv.org/abs/2302.07979

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Examples of text guided image eโ€ฆ
0
2
1
@memetzgz If I wanted to read all of them, yes ๐Ÿ˜› But I just skim through the summaries to find the ones that look interesting โ€ฆ At the moment, Iโ€™ve found 5 interesting papers and have whittled down the 93 to 39 remaining papers โ€ฆ
0
0
1
Edited 3 years ago

I saw this little guy yesterday and I can't stop thinking about him. So round. So pink.

(It's an Australian pink robin and the photos are not mine.)

Credits:
1. ยฉ Deepak Karra http://instagram.com/ravi_arora/
2. ยฉ Ravi Arora http://instagram.com/ravi_arora/
3. ยฉ Ambika Angela Bone http://instagram.com/ambikangela/
4. ยฉ Tim J. Hopwood http://flickr.com/photos/timjhopwood/

42
3
0

Fahim Farook

A total of 93 papers in the cs.CV category on arXiv.org today โ€” 61 new, 32 updated going in to the weekend โ€ฆ

So yesterdayโ€™s low paper count was definitely not a slowdown ๐Ÿ˜›

#AI #CV #NewPapers #DeepLearning #MachineLearning
1
0
0

Fahim Farook

Updated Akkoma on our server to the latest. New graphs, yay!! Hopefully, nothing broke though since Iโ€™m always scared of stuff breaking once Iโ€™ve updated since Iโ€™m always in a rush โ€ฆ
1
1
1

I ended up testing our new Neural Telephoto feature by shooting with iPhone SE for a while. I loved the shots I got out of it w/ native RAW and the extra reach of the virtual 2ร—.

Death Valley / Owens Valley, California
iPhone SE, 1ร— / 2ร— (Neural Telephoto), @halide RAW

8
2
0

I was quietly minding my own business taking photos of Bullfinches (small/far away) in a tree when this Robin came and perched right in front of my camera and demanded to have his picture taken. How could I refuse?

๐Ÿคฃ

1
3
0


La passiflore grande et belle fleur

2
1
0

The single-digit temperatures and a bit of sunshine created an ethereal ground fog this morning.

๐Ÿ“† Feb 16, 2023
๐Ÿ“ท 1/1600 s at f/11, ISO 180, 170mm

0
3
1

The Adam optimizer is at the heart of modern AI. Researchers have been trying to dethrone Adam for years.

How about we ask a machine to do a better job? @googleai uses evolution to discover a simpler & efficient algorithm with remarkable features.

Itโ€™s just 8 lines of code: ๐Ÿงต

0
2
0
@ajyoung On Apple Silicon, the initial model load times are way longer for SPLIT_EINSUM compiled models than it is for ORIGINAL models. The EINSUM ones sometimes take about 2 minutes to load while the ORIGINAL ones load in about 10 -20 seconds at most.

Of course, some of this also depends on how you have the model loading, but once loaded, the initial image takes about 2 seconds longer to generate but then subsequent images at 10 - 20 steps are very fast. But if you go over 20 steps on the DPM-Solver++, it takes much longer. I think I had one on 50 steps which never completed and after about 3 - 5 minutes I just cancelled it โ€ฆ

So there are a bunch of factors at play and also depends on how you load the models. I find the Swift apps the easiest since you just load the model and then donโ€™t unload the model till you quit the app ๐Ÿ™‚
1
0
1

quick thread, retweets of which would be most welcome. just got off the line with a recent CS/math grad and Ukrainian woman who just got laid off seven months after starting her career in tech. she's on her own here and supporting her family there.

1
2
0

Redoing the after moving server.

I'm Elias (he/him)

I run a small regenerative farm called Viriditas in (Entenc el catalร , hablo castillano).

I'm interested in regenerative agriculture, self organisation, federated food security , , .

Especially looking for other regenerative farmers producing in dry climates to discuss best practices.

2
2
0
@simonemargio ๐Ÿ˜› Iโ€™m sorry you had to go through that โ€ฆ I really wish that certain things were better designed instead of just kind of put together as we go along โ€ฆ
0
0
0
@simonemargio Yeah, thatโ€™s what it sounds like โ€ฆ What language/platform were you using?
0
0
0

Fahim Farook

I donโ€™t know if this is the same on other platforms/languages but #Apple #Swift support for image metadata retrieval seems to be rather pitiful ๐Ÿ˜› I tried multiple different suggested approaches and lirbaries and all I get are around 4 - 5 meta data items for a particular image.

I use exiftool (https://exiftool.org/) and I get around 20+ metadata items for the same file.

So I resorted to this exiftool wrapper โ€” https://github.com/hlemai/ExifTool It is supposed to be using the exiftool libraries but even that got only 18 items and it left out the one item I was actually interested in ๐Ÿ˜›

So I finally ended up writing some custom code which would run exiftool locally as a task, get the output of the command and then parse the output to get the metadata. That finally worked for meโ€ฆ

But should I have to go through all these hoops to get image metadata when using Swift? I would have thought there was a simpler/easier solution?

Does anyone know of one?

Iโ€™m almost tempted to write my own solution in Swift where I read the image raw data, parse the header and get the metadata. But do I really want to? Probably not โ€ฆ

#Apple #Swift #Images #Metadata #Coding
0
0
2
Show older