Skip to content

ankitagrahari/MorphousAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MorphousAI

MorphousAI is a simple and intuitive AI application that lets you upload images, generate images from text, and convert text to audio — all powered by OpenAI APIs.

This project showcases how you can build a multi-modal AI tool using OpenAI capabilities to handle image generation, text-to-image, and text-to-speech workflows. GitHub


🚀 Features

✔️ Upload an image and generate outputs
✔️ Convert user-entered text into another image
✔️ Convert text to natural-sounding audio
✔️ Uses OpenAI APIs for generative models
✔️ Easy to extend & customize


🧠 How It Works

This project combines several AI modalities:

  1. Image Upload Module

    • Allows users to upload an image from their device.
  2. Text-to-Image Conversion

    • Uses OpenAI’s image generation APIs to create images from prompts.
  3. Text-to-Audio Conversion

    • Uses OpenAI’s text-to-speech APIs to synthesize speech from text.

Developers can extend or modify this to include additional AI workflows like speech-to-text, image captioning, or more advanced multi-modal interactions.


🛠️ Tech Stack

📌 Java (primary codebase)
📌 Spring AI to integrate with OpenAI (for image & audio generation)
📌 Vaadin - Web frontend for uploads & interaction

🔧 Prerequisites

Before running the project locally, make sure you have:


📝 Setup & Installation

  1. Clone the repository
git clone https://github.com/ankitagrahari/MorphousAI.git
cd MorphousAI
  1. Install dependencies
mvn clean install
  1. Configure your environment
Create a .env file in the root:

OPENAI_API_KEY=<your_openai_api_key_here>

⚠️ Make sure not to commit your .env file or API key to GitHub.

4.	Start the app
mvn spring-boot:run -Dspring-boot.run.arguments="--server.port=9090"

Your app should now be running locally.

📌 Usage

✏️ Text to Image

1.	Enter a text prompt.
2.	The app sends the prompt to the OpenAI image API.
3.	View the generated image in the UI.

🔉 Text to Audio

1.	Enter text you want to hear.
2.	The app uses OpenAI’s text-to-speech API.
3.	Download/play the generated audio.

📷 Image Upload

1.	Upload an image from your device.
2.	Extend this module to perform image captioning, text extraction, or further generative steps.

🧪 Example Screenshots

Screenshot 2025-12-16 at 7 35 34 AM Screenshot 2025-12-16 at 7 35 59 AM Screenshot 2025-12-16 at 7 37 12 AM Screenshot 2025-12-16 at 7 39 36 AM

🧠 How to Contribute

Contributions are welcome! You can help by: • Adding new AI features (speech-to-text, image captioning) • Improving UI/UX • Writing tests or bug fixes • Updating documentation

To contribute:

git fork https://github.com/ankitagrahari/MorphousAI git clone <your_forked_repo> git checkout -b feature/my-awesome-feature

Submit a pull request and describe your changes.

📄 License

Distributed under the MIT License. See LICENSE for more information.

❤️ Acknowledgements

Thanks to: • OpenAI for powerful generative models 🌟 • ✨ The developer community for inspiration

📞 Contact

If you have questions, feel free to reach out:

Ankit Agrahari – 💬 GitHub: https://github.com/ankitagrahari 📎 Email: (ankitagrahari.rkgit@gmail.com)

🎉 Happy building with MorphousAI!

About

Helps upload image, convert text to image and text to audio using OpenAi

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages