Posts
1411
Following
142
Followers
869
I'm a bit of an eclectic mess πŸ™‚ I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: πŸ‡±πŸ‡°πŸ‡ΈπŸ‡¦πŸ‡ΊπŸ‡ΈπŸ‡³πŸ‡ΏπŸ‡ΈπŸ‡¬πŸ‡²πŸ‡ΎπŸ‡¦πŸ‡ͺπŸ‡«πŸ‡·πŸ‡ͺπŸ‡ΈπŸ‡΅πŸ‡ΉπŸ‡ΆπŸ‡¦πŸ‡¨πŸ‡¦

Fahim Farook

"Consistency Models. (arXiv:2303.01469v1 [cs.LG])" β€” A new family of generative models that achieve high sample quality without adversarial training that supports fast one-step generation by design.

Paper: http://arxiv.org/abs/2303.01469

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Samples generated by EDM (top),…
0
1
0

Fahim Farook

"3D generation on ImageNet. (arXiv:2303.01416v1 [cs.CV])" β€” A more detailed/accurate method to generate 3D images based on 2D input images.

Paper: http://arxiv.org/abs/2303.01416

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Selected samples from EG3D (Cha…
0
0
0

Fahim Farook

"Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation. (arXiv:2303.01311v1 [cs.CV])" β€” Generating random game characters simply based on text input instead of customizing a pre-created character's visual attributes.

Paper: http://arxiv.org/abs/2303.01311

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Game characters created by the …
0
0
0

Fahim Farook

"Combining Generative Artificial Intelligence (AI) and the Internet: Heading towards Evolution or Degradation?. (arXiv:2303.01255v1 [cs.CV])" β€” Would the quality of generative AI tools be affected if the input images are generated by AI tools themselves? An initial (simulated) experiment to explore this question.

Paper: http://arxiv.org/abs/2303.01255

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Two sets of sample images β€” the…
1
1
1

Fahim Farook

"X&Fuse: Fusing Visual Information in Text-to-Image Generation. (arXiv:2303.01000v1 [cs.CV])" β€” Multiple ways to condition images prior to text-to-image generation to achieve better output results.

Paper: http://arxiv.org/abs/2303.01000

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
X&Fuse conditions the model on …
0
1
1

Fahim Farook

"Scalable Diffusion Models with Transformers. (arXiv:2212.09748v2 [cs.CV] UPDATED)" β€” Creating diffusion models that use transformers instead UNets.

Paper: http://arxiv.org/abs/2212.09748
Code: https://github.com/facebookresearch/DiT

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Diffusion models with transform…
0
0
1

Fahim Farook

"Collage Diffusion. (arXiv:2303.00262v1 [cs.CV])" β€” Creating harmonious and cohesive output images based on a text prompt and a collection of images as input.

Paper: http://arxiv.org/abs/2303.00262

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Collage Diffusion creates globa…
0
0
1

Fahim Farook

"PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing. (arXiv:2303.00137v1 [cs.CV])" β€” Generating realistic shadows and reflections using 2D images and deep learning techniques.

Paper: http://arxiv.org/abs/2303.00137

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
PixHt-Lab renders realistic ref…
0
0
0

Fahim Farook

"Magic: Multi Art Genre Intelligent Choreography Dataset and Network for 3D Dance Generation. (arXiv:2212.03741v3 [cs.CV] UPDATED)" β€” A choreography dataset and a network for generating 3D dance segments based on a music clip as input.

Paper: http://arxiv.org/abs/2212.03741

Note: v3 of the paper is currently not available in PDF form on arXiv.

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
A conceptual overview of Magic.…
0
1
0

Fahim Farook

"Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning. (arXiv:2302.14794v1 [cs.CV])" β€” Rather than using a frozen language model to communicate visual concepts, this method uses a meta -mapper to act as a bridge between large-scale visiona and language models.

Paper: http://arxiv.org/abs/2302.14794

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Multimodal few-shot meta-learni…
0
0
1

Fahim Farook

"TextIR: A Simple Framework for Text-based Editable Image Restoration. (arXiv:2302.14736v1 [cs.CV])" β€” Using text input to restore damaged images by specifying how to fill in the damage areas by way of text descriptions.

Paper: http://arxiv.org/abs/2302.14736

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Overview of image restoration r…
0
0
0

Fahim Farook

"Towards Enhanced Controllability of Diffusion Models. (arXiv:2302.14368v1 [cs.CV])" β€” Creating a diffusion model that is easier to edit/style based on input images by conditioning the model on a spatial content mask and a flattened style embedding.

Paper: http://arxiv.org/abs/2302.14368

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Comparison of the proposed mode…
0
0
0

Fahim Farook

"One-Shot Video Inpainting. (arXiv:2302.14362v1 [cs.CV])" β€” A method to inpaint videos where instead of having to provide masks for each frame, you only need to provide the object mask for the initial frame in the video sequence.

Paper: http://arxiv.org/abs/2302.14362

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Qualitative comparison between …
0
1
1

Fahim Farook

"Deep Learning for Identifying Iran's Cultural Heritage Buildings in Need of Conservation Using Image Classification and Grad-CAM. (arXiv:2302.14354v1 [cs.CV])" β€” Using machine learning to identify damage and defects to cultural heritage buildings using Convolutional Neural Networks (CNN).

Paper: http://arxiv.org/abs/2302.14354

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Comparing a picture with small-…
0
0
1

Fahim Farook

"Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation Systems. (arXiv:2301.01914v2 [cs.CV] UPDATED)" β€” Comparing the accuracy and fidelity of images generated by DALL-E 2 and Luna, which is Stable Diffusion-based.

Paper: http://arxiv.org/abs/2301.01914
Luna code: https://github.com/slowy07/luna

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Selected image samples from the…
0
0
1

Fahim Farook

"Diffusion Posterior Sampling for General Noisy Inverse Problems. (arXiv:2209.14687v3 [stat.ML] UPDATED)" β€” Extending diffusion solvers to efficiently handle general noisy (non)linear inverse problems via approximation of the posterior sampling.

Paper: http://arxiv.org/abs/2209.14687
Code: https://github.com/dps2022/diffusion-posterior-sampling

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Solving noisy linear, and nonli…
0
0
1

Fahim Farook

"Subspace Diffusion Generative Models. (arXiv:2205.01490v2 [cs.LG] UPDATED)" β€” Restricting diffusion via projections onto subspaces to reduce computational time and cost without affecting the overall quality of the generated image.

Paper: http://arxiv.org/abs/2205.01490
Code: https://github.com/bjing2016/subspace-diffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Random high resolution samples …
0
0
0

Fahim Farook

"Large Scale Visual Food Recognition. (arXiv:2103.16107v3 [cs.CV] UPDATED)" β€” A food dataset with 2,000 categories and over 1 million images that can be used for food recognition.

Paper: http://arxiv.org/abs/2103.16107
Code: https://github.com/Liuyuxinict/prenet/

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
The distributions over each cat…
0
1
1

Fahim Farook

"Directed Diffusion: Direct Control of Object Placement through Attention Guidance. (arXiv:2302.13153v1 [cs.CV])" β€” Controlling object placement in diffusion models by way of attention guidance.

Paper: http://arxiv.org/abs/2302.13153

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Directed Diffusion (DD) key res…
0
0
1

Fahim Farook

"In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages" β€” Measuring the formality of the generated text for different languages using multilingual generative language models.

Paper: https://arxiv.org/abs/2302.12299

#AI #NewPaper #DeepLearning #MachineLearning #Language

<<Find this useful? Please boost so that others can benefit too πŸ™‚>>
Differences between formal and …
0
0
1
Show older